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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Gatanaga, T. 

Granger, G.A. 

(ti) TITLE OF INVENTION: Factors Altering Tumor Necrosis 
Factor Receptor Releasing Enzyme Activity 

(Hi) NUMBER OF SEQUENCES: 154 

(iv) CORRESPONDENCE ADDRESS: 

<A> ADDRESSEE: HORRISON & FOERSTER 
<B) STREET : 755 PAGE HILL ROAD 
CO CITY: Palo Alto 

(D) STATE: CA 

(E) COUNTRY: USA' 
<F> ZIP: 94304-1018 



(v) COMPUTER READABLE FORM: 
(A) MEDIUM TYPE: Diskette 

i'l <B) COMPUTER: IBM Compatible 

!iS? * . — 



(C) OPERATING SYSTEM: Windows 
*,J <D) SOFTWARE: FastSEQ for Windows Version 2.0b 



ib (VI) CURRENT APPLICATION DATA: 

I;:! (A) APPLICATION NUMBER: 

j L? <B) FILING DATE: 

|;n (C) CLASSIFICATION: 

^ <vii) PRIOR APPLICATION DATA: 

O <A> APPLICATION NUMBER: USSN 09/081,385 

j: (B) FILING DATE: 014-NOV-1998 

M (viii) ATTORNEY/ AGENT INFORMATION: 

lU (A) NAME: 

<B) REGISTRATION NUMBER: 

(C) REFERENCE/DOCKET NUMBER : 22000-20577.21 

<ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 650-813-5600 
U (B) TELEFAX: 650-494-0792 

<C) TELEX: 706141 

(2) INFORMATION FOR SEQ ID 110:1: 

Ci) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 4047 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1: 



AAGCTTTTTG CTTTCCTTCC CCGGGAAAGG CCGGGGCCAG AGACCCGCAC TCGGACCAGG 60 

CGGGGGCTGC GGGGCCAGAG TGGGCTGGGG AGGGCTGGGA GGGCGTCTGG GGCCGGCTCC 20 

TCCAGGCTGG GGGCCGCCAG CTCCGGGAAG GCAGTCCTGG CCTGCGGATG GGGCCGCGCG ISO 

TGGGGCCCGG CGGGGCGGCC TCGGGAGGCG TCCAGGCTGG GGGAGCGGGA GWGCGGCCG 240 

tgcgggcgcc AGCGCCGTGG GTGGAGGTCG CCGTCCCTCC tgaggggcag ccagtgcgtt 300 

TGGGACCCGG GAGCAGAGCC CGCGCCTCCC CAGCGGCCTC CCCGGGGGTC TCACCGGGTC 360 

ACCCGAGAGC GGAGGCCCCG gctccgcaga aacccggggc ggccgcgggg aagcagcgcc 420 

CTCAGGCGTC GGAGGAGCCC CCAGAAGGAC CTCGCGCCTT CCCGCCGGGC TCCGACCGCC 480 

TGGGTTCGGT GCGGGACGGC CCAGGCCGCC aggaccccca agcgcagctc agtctgcggg 540 

GCACGACCCA GAGGCCAGCA GCAGAGGACG GGGCCGGGGC CGGGAGAGGG CGGGGAGGGC 600 

GCTCCTGGGA GGTCAAGGCC AGGGCTAGAC TTTCAGGGTC ATGGCCTGGC CCCTCATCCC 660 

CAGGGAGGTG AGGGGGCTCT GTGAGCAGAG GGGGCCCCGG TGGAGAAGGC GCTGCTAGCC 720 
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AGGGGCGGGG 
AGGGATAGGG 
CTTTATTCTG 
ATTTTCAGGA 
GAGTATGGGG 
CGGGGACCAG 
CCTTTCACGG 
CTGGCAGAGC 
T6GAGATCTG 
CCCCGGGTGC 
CCGGACACGG 
GAGGTGATGA 
GAGGGATTGA 
GCCCGGCTGG 
GGTCTTCATG 
CATGGGGTCT 
CCAGGTGAGG 
CTGGCGCCCA 
GTGGGGACAT 
TCCTCCTCCA 
CAGGGAGACT 
TTACAATTCC 
GAGAAATCAG 
AAGCGAGTGA 
AACCAACAAC 
TGACTACATA 
CTATGCCTGT 
CAACTGAAGT 
CTTGCTGCGA 
GTGTTTTTGA 
TGCTATTTTC 
GAGAAAGTCT 
TTGCCGCTTT 
TTTGTTTTCT 
AAGGTAGAGA 
TTCCAGCAGG. 
TTGTTGAAGA 
TTGAACCATG 
AAACTTTATT 
CTT6TT6CAT 
CAGGGCTTTG 
TCCCAGAGCT 
GTTGATAATT 
ACATCATCAT 
GACTCCTTGC 
CAAGGTTATC 
GTGAAACCAA 
CATTAAAGAG 
TCAGGAGGTC 
CCCCATTCAT 
AGCTGTCATT 
GGCCGAGGCC 
CCTCTTCCGT 
CAAATCGTTG 
ACCCGACCAC 
CTGTCGGCGG 



CAGGAGCCCA 
AGGGAAGCTA 
AAGCCGAATG 
GTTTCAGGTG 
TGAGAACCAC 
AATTTTTTAA 
AGAAAGTGTG 
TGACGGAAGG 
GAATGAGGCA 
CCCTTTGAGC 
GTCGTGTGCT 
CTGCCTTCCA 
GTGGAGTCCA 
GGTGAGAAGG 
CCTAGGAACC 
CTCCTTCAGG 
CGGATGTGGC 
GGGCCTGGGC 
TACATCTTCC 
TGTTCTTCTC 
CAAGTGGAAA 
AGGGACAAGA 
AGACAGCTTC 
GAGAAGCCAA 
CACCCCCACC 
GTTCCCTGCT 
TCCAGGCTGC 
GTTCTGTTGT 
CAAACATTGA 
AGTCTGTCCT 
CACACCAGAA 
TAATGTAAGT 
CTCACTCTGC 
TTCTGGAGAA 
CTGAGATTGT 
GGAGCATACT 
ATCTCCATCG 
TCTAGGATTA 
ATCATGTATG 
CTGTGGATGT 
TGGGGCGTAT 
TGTCTGTGGT 
GGAAGATGCA 
TCCACTTTGT 
CAGGACGGCC 
TTTCAACTCT 
ATTGTCATCC 
AAGGCTCTTT 
TGTGAGACTA 
CCTGAATTTC 
ACTTCCCATT 
GCTGTGGCGA 
CTCCTCAGTG 
CTTCTTCTTC 
GGGCCCGAGC 
GGGCGTCCTT 



GGTGGGGACT 
GGGGTGTGGC 
TGCTGCCGGA 
GAGACAAAGG 
GGAGAGAGGA 
AACGCATCTG 
GGCAGAAGCC 
CCAGGGCAGA 
GATGTGCCCA 
GTGGAATCGT 
GCACCTGGCA 
GGGGCCTGGC 
CGGGTCATGG 
GGCTGGAGAC 
CACTGGGGGC 
AGGTCTGGGT 
CTGGCGCTGT 
TTGAGACAGC 
TAGCTTGGAA 
TTGAGAACTT 
GCCCTCATCC 
GCGATGCATG 
CTGGAGGCAG 
CTGGGATGGA 
CTAGGCCCAG 
GGATCTTAGG 
ACCAAGGTTT 
AAAACAGGCA 
GCAGCATTTA 
GGGTACTGCA 
ATCATATCCT 
TTAGTAGAGT 
TCTTTTTCAT 
TCTAGCAAGA 
AGGATTAAAG 
GGCTGTGGAG 
TCACTACTTA 
TCAGCATGAT 
TTTCTCAAAC 
GTTGTGTAAC 
GGACTAGGCA 
GGCACAAACC 
GGACTACTGT 
ACATATCTGT 
TGCAACAAGA 
ATGTGATCTG 
CTACAAGCCA 
GTAGTTGTAG 
GCAATCCCCG 
TGGAATGGTG 
TCCCAAACAC 
GAGACTATAA 
CGGGGAACAT 
CTCCTCCTCC 
TACCGGGGGG 
TGGATCC 



TAAGGGTGGC 
TTGGGGAGGT 
GTCCCCAGTG 
CCAGGCCCAG 
AGTCCCCGAG 
AGATGCGTTT 
AGCTCTAAAG 
GCCTTCCCTC 
GGGAAAGCTG 
TGCCAGGTCA 
GTTGCAGGAC 
TGGCTGACAC 
CCACGTCCTG 
AGGTTCCTGC 
TTATGAAACT 
GGGGCCGGAG 
GTGGCTCTGG 
TGTGACGCAG 
CACACAGGCA 
TTACCAGGTA 
TTGTCCTCCA 
TGAGGTGTGG 
TGACACCTGG 
CCCATCATGT 
AGCTCACGGA 
TCTTGTCCTT 
TGTGACTATT 
CTTGATTTGC 
GTGGGCGGTT 
CATTAAAAGG 
CTTGCTGGTC 
CCTTGGATGG 
CCTTGCCTAA 
TATCTGGTGG 
GTGGTCTTGA 
ATCTCAAGGG 
GCCTGTGCAC 
GATTAGCTGA 
CACCTGCAAC 
TTGAAGGATG 
CTGGGCTATT 
GGCTGGAGGG 
GCATGGAATT 
TCTGCATGCT 
CTGGTATGTC 
TTGATACCTG 
AAAGGCAGTT 
CAGGTAAAGG 
CAAGAGTAGT 
TTGCCTATAA 
TGGGCGAATC 
TCCGGGCCGG 
GTAGAGCCGG 
TCCTTCTCCC 
GCATCGCCGC 



TGAAGGGACC 
GCTGGGGGAC 
ACCTAGAAAT 
GTGAAAATGT 
GCGGATGATG 
GGCAGACTCA 
CCCAGGCTGC 
CCTGTCACAG 
ATCCGCCCCG 
TGGCTCCCTG 
CGACACCCAC 
TTTGCATGGC 
GGTGCTGCCT 
CAGTTCAGCC 
GCAGGTGGCT 
ACTGTACCCC 
ACCTAGTCCT 
GCAAGCCATT 
GCCAGGGTTG 
TGTCAGGAGC 
GGAGAGAGGA 
CAAATCTCAC 
ACAGGCTTCT 
AGGGGGAACA 
GAGAGCTGGG 
GGGCAGCTCT 
GGTCTGGGGT 
TGGAAGGAAT 
TATATCTTGT 
AATATCATTT 
CATGTCTGAA 
AGAACTAATT 
TTTCATTTTC 
AACATCTCGA 
GCCTTTAGGA 
AAAAGATGCA 
CATGTGTAGG 
ATTGCCAGAC 
AATGGGACTT 
GGAATATGGC 
TTGCTGTGGC 
GCTATGTGAG 
CTGAGAAAGT 
TTTCTCCCTG 
ACCTTCTGGG 
GTTGAGGCTA 
CACCTCTTCT 
AGATGGAAGA 
AATGGGGACA 
AAGTACTTAG 
GGCGTCTGAA 
GAGGGGGGGC 
GGGGAGACCA 
ACATAGAAAC 
GGGCCCGGGA 



CTCAGGCTGC 
CGCGGGCGCC 
CCATTTCAAG 
GGCAGTGACA 
GGACAGAGAG 
TAGTTGTTTT 
CCAGCCTGCA 
ACATGAGCCC 
ACCCAGGGCC 
CTATCGAACA 
AATGCCTTAA 
TCCTGGAGAA 
CTGAGGCAGG 
TCTAACCGGT 
GAGTCCTTGC 
ACAAAGGGTC 
TGGGCTTGGG 
TACCCCGTTT 
TTATCCACAT 
TGGGCTCCAC 
AAACCTATGG 
TGTTCAACTG 
CCACAGGAGG 
GTGCGCGCAG 
CCTCTCGGGG 
GCTGAGACCT 
TGTTTTGCAG 
GCTGTTTGTT 
GGAGTAATGG 
TCTGAAACAT 
GACCTTACAC 
ATATCATACA 
TTCTGCTTCT 
GGTGATGAAC 
GTTCCTTCAC 
GCATTCCTCA 
TAGTCCTCAC 
AACGGACCAG 
GATACCGATG 
ATGTATCCTG 
ATAAATCTGT 
ATAGTGGTTT 
TTATACTGAG 
AAAACATTAG 
TCATCACTGC 
TGGACAAGCT 
GCTATTCGTG 
GGCAGCTGGT 
TGGGGCATAT 
TTCAGGTGCC 
TCCAAGGGGA 
GGCTACGGCT 
GCCGAGAAGA 
ACTCACAAAC 
ACCAATTCTC 



780 

840 

900 

960 
1020 
1080 
1140 
1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960. 

4020 

4047 



(2) INFORMATION FOR SEQ ID HO:2: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 739 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

GGATCCAAAG GTCAAACTCC CCACCTGGCA CTGTCCCCGG AGCGGGTCGC GCCCGGCCGG 
r^rrrrrCG GGCGCTTGGC GCCAGAAGCG AGAGCCCCTC GGGGCTCGCC CCCCCGCCTC 
S! SKIS ATCAGAGTAG TGGTATTTCA CCGG^GCCC GCAGGGCCGG 
CGGACCCCGC CCCGGGCCCC TCGCGGGGAC ACCGGGGGGG CGCCGGGGGC CTCCCACTTA 
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TTCTACACCT CTCATGTCTC TTCACCGTGC CAGACTAGAG TCAAGCTCAA CAGGGTCTTC 
TTTCCCCGCT GATTCCGCCA AGCCCGTTCC CTTGGCTGTG GTTTCGCTGG ATAGTAGGTA 
GGGACAGTGG GAATCTCGTT CATC CAT TCA TGCGCGTCAC TAATTAGATG ACGAGGCATT 
TGGCTACCTT AAGAGAGTCA TAGTTACTCC CGCCGTTTAC CCGCGCTTCA TTGAATTTCT 
TCACTTTGAC ATTCAGAGCA CTGGGCAGAA ATCACATCGC GTCAACACCC GCCGCGGGCC 
TTCGCGATGC TTTGTTTTAA TTAAACAGTC GGATTCCCCT. GGTCCGCACC AGTTCTAAGT 
CGGCTGCTAG GCGCCGGCCG AAGCGAGGCG CCGCGCGGAA CCGCGGCCCC CGGGGCGGAC 
CCGCGGGGGG GACCGGGCCG CGGCCCCTCC GCCGCCTGCC GCCGCCGCCG CCGCCGCGCG 
CCGAAGAAGA AGGGGGAAA 

(2) INFORMATION FOR SEQ ID N0:3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 233 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; double 

(D) TOPOLOGY: linear 

(i i) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 

CAAGAGTGGC GGCCGCAGCA GGCCCCCCGG GTGCCCGGGC CCCCCTCGAG GGGGACAGTG 
CCCCCGCCGC GGGGGCCCCG CGGCGGGCCG CCGCCGGCCC CTGCCGCCCC GACCCTTCTC 
CCCCCGCCGC CGCCCCCACG CGGCGCTCCC CCGGGGAGGG GGGAGGACGG GGAGCGGGGG 
AGAGAGAGAG AGAGAGAGGG CGCGGGGTGG CTCGTGCCGA ATTCAAAAAG CTT 

(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2998 base pairs 
(8) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

GGATCCAAAG AATTCGGCAC GAGGTAGTCA CGGCTCTTGT CATTGTTGTA CTTGACGTTG 
AGGCTGGTGA GCTTGGAAAA GTCGATGCGC AGCGTGCAGC AGGCGTTGTA GATGTTCTGC 
CCGTCCAGCG ACAGCTTGGC GTGCTGGGCG CTCACGGGGT CCGCATACTG CAGCAGGGCC 
TGGAACTGGT TGTTCTTGGT GAAGGTGATG ATCTTCAACA CTGTGCCGAA CTTGGAGAAA 
ATCTGGTGCA GCACATCCAG GGTCACAGGG TAGAAGAGGT TCTCCACGAT GATCCTGAGC 
ACGGGGCTCT GCCCGGCCAT CGCCATCCCT GCATCCACGG CCGCCGCCGA GGCAGCCAAG 
GCCAGGTTCC CCGACTGGAC CGAGTTCACC GCCTGCAGGG CCGCCTGGGC CCGCGCCTGG 
TTGGGAGAGC TGTCGGTCTT CAGCTCCTTG TGGTTGGAGA ACTGGATGTA ^™GGCTGG 
CCGCGCAGCA CAGGGGTCAC CGAGGTGTAG TAGTTCACCA TGGTATTGGC AGCCTCCTCC 
GTGTTCATCT CGATGAAGGC CTGGTTTTTC CCCTTCAGCA TCAGGAGGTT ^TGACCTTC 
CCAAAGGGCA GCCCCAGGGA GATGACTTCC CCCTCCGTGA CGTCGATGGG ^CTTCCGG 
ATGTGGATCA CTCTAGAGGG GACGCCTGCA CTTCGGCTGT CACCTTTGAA CTTCTTGCTG 
TCATTTCCGT TTGCTGCAGA AGCCGAGTTG CTGCTCATGA TAAACGGTCC GTTAGTGACA 
CWGTAGAGA AAAGCTCGTC AGATCCCCGC TTTGTACCAA CGGCTATATC TGGGACAATG 
CCGTCCATGG CACACAGAGC AGACCCGCGG GGGACGGAGT GGAGGCGCCG GAATCCTGGA 
GCTAGAGCTG CAGATTGAGT TGCTGCGTGA GACGAAGCGC AAGTATGAGA JTGTCCTGCA 
GCTGGGCCGG GCACTGACAG CCCACCTCTA CAGCCTGCTG CAGACCCAGC ATGCACTGGG 
TGATGCCTTT GCTGACCTCA GCCAGAAGTC CCCAGAGCTT CAGGAGGAAT TTGGCTACAA 
TGCAGAGACA CAGAAACTAC TATGCAAGAA TGGGGAAACG CTGCTAGGAG CCGTGAACTT 
CTTTGTCTCT AGCATCAACA CATTGGTCAC CAAGACCATG GAAGACACGC TCATGACTGT 
GAAACAGTAT GAGGCTGCCA GGCTGGAATA TGATGCCTAC CGAACAGACT TAGAGGAGCT 
GAGTCTAGGC CCCCGGGATG CAGGGACACG TGGTCGACTT GAGAGTGCCC ^GCCACTTT 
CCAGGCCCAT CGGGACAAGT ATGAGAAGCT GCGGGGAGAT GTGGCCATCA AGCTCAAGTT 
CCTGGAAGAA AACAAGATCA AGGTGATGCA CAAGCAGCTG CTGCTCTTCC ACAATGCTGT 
GTCCGCCTAC TTTGCTGGGA ACCAGAAACA GCTGGAGCAG ACCCTGCAGC AGTTCAACAT 
CAAGCTGCGG CCTCCAGGAG CTGAGAAACC CTCCTGGCTA GAGGAGCAGT GAGCTGCTCC 
CAGCCCAACT TGGCTATCAA GAAAGACATT GGGAAGGGCA GCCCCAGGGT GTGGGAGATT 
GGACATGGTA CATCCTTTGT CACTTGCCCT CTGGCTTGGG CTCCTTTTTC TGGCTGGGGC 
CTGACACCAG TTTTGCCCAC ATTGCTATGG TGGGAAGAGG GCCTGGAGGC CCAGAAGTTG 
CTG^CCTGTC TATCTTCCTG GCCACAGGGC TTCATTCCCA GATCTTTTCC TTCCACTTCA 

cIgcSIcgg J?Itgacaaa accactccct ggccaatggc atcactcttc aggctggggt 

GTGCTCCCTG ACCAATGACA GAGCCTGAAA ATGCCCTGTC AGCCAATGGC JGCTCTTCTC 

GGACTCCCCT GGGCCAATGA tgttgcgtct aatacccttt gtctctcctc tatgcgtgcc 

CATTGCAGAG AAGGGGACTG GGACCAAAGG GGTGGGGATA ATGGGGAGCC CCATTGCTGG 
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CCTTGCATCT GAATAGGCCT ACCCTCACCA TTTATTCACT AATACATTTT ATTTGTGTTC 2100 

tcwatttE AMTACCTTT TCATCTTGCT TGATTTTCCT TCAGCTAAAT TAGAAATTTG 2160 

TAGTTTTTCC CCTAAAAAAT TCAATGGCAT TCTTTCTTAT AAATTACATT CTCTGATTTT 2220 

SS CAGCC "cT^G AAATCCATGT GTTCAAAATG CTTGCTCGCA GTTTGCTCCA 2280 

xirrlAftTrr TTGCTTAACC CAAATATCTG AGCAGCAAAT TGAGCTGATC CTTCTGGAGA 2340 

AAGTACGGTT GAACAGCCAA GACCACTGGG TAGTCGAAGA GAAGACCACA CATCCTGMC 2400 

TCCCCAGTCT GGTGTGAGGG GAGGACAGCT GATAACTGGA TATGCAGTGT TCCCAGACAT 2460 

PAPTRGTCCC AAACCATTAC TTCTGCCTGC CACTGCCACA AATACAGTAG GAATGCCATC Z5ZU 

c2S?cI?AC TCAGCTTTAA TCCTCAGAGT TTCATCTGGT CCTTTATGCG CAGATGTTAC 2580 

SSIg?™ CATGGAATGC CAAAATTTCC ACAGGCCTTC TTGATTTTTT CACAGTGACC 2640 

MG^S TCAACACTAC AACCCTGCAC TGACTTTCTG ATTTCAAAAG 2700 

CAACTCTACT CTCTCTGCAA CCCACTCAAA GTTTTTCTTT ACCATTTGGA GCCCTTCAGG 2760 

T«yj I pTTpr TTrirrTCCC GATAAGACTG TTTGTCTTTC TGTTGGCTTC GATCTCCTGA 2820 

SgIgT £cSS 5 S AACATCAGCA AGAACAATTT CTTTGGTGGT 2880 

Iaca?£a^I CCAAATTCAA TCTTCATATC AACCAGTGTA CAATTCTGGG GCAACCAGGA 2940 

TAGCCTGTGT AGCATCTCGT GCCGAATTCA AAAAGCTT 2998 

(2) INFORMATION FOR SEQ ID HO:5: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 4152 base pairs 
<B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Genomic DNA 

<X1) SEQUENCE DESCRIPTION: SEQ ID NO:5: 

AAGCTTTTTG TGAAAACCCT AGGATATGTC CCCTCCCTCA CCACACCCAA CCCCCCGCCC 

c^gcccSgg acatgacgat GCCTCACACA cacacacaca cacacataca cacaaggccg 

TGAGCTGCAC GCAGGAACAT GGGCTGCACT CACGACAACA TTGAAAAAAT ATACATTATA 
TATrTACACC CGGGGCCCCC ACGTCCCCTC CCGTCCCCGC AGCCTGGCCA CACCAGGTCA 

S GCCGGGGCTG CAGGACCTCA GGACTGCAAG GGCAGGAAGG GAAACAGGAC 300 
AAriAirrAA GGAAGTTGGA AAGGAGGGAG AAATGGGGTC CCCAGACTGA AATGGAAATG 360 
AGGTGGGGCG — AAGCAGGGAC aATGGTCCAG = GAGC CCTGCAGAGG 42 

KSE £2SS£ iSSSS CTCCCCAACA 5 0 

S TAAGGGGGCA GCTGCCATTG GGGGTAGAAA GG^GCTGAA GTCCAGCCCA 600 
CTTTCCAACC CAGCCAGCCC CAGTGCAAGG GGCACACCAG GAGCATGACA GCCCAGAAGT 660 

gISgg Sgggg AGGGGCAGGG CGGACTCCAG agggcccgct ggggttttga 720 

AATGAAAGGA GGACTGGTTC TGAAGCCTCT CTCCCTCTTG GTCTCTGTGT TCCCAGAAAG 780 

TPrTTCTCCC ATGTCTGGAG TGTCTGTTTC ACCAGGGCAG AATTCCCCCT CTGCGTGGGG 840 

fccS CGGTGTGGGG GGGTCTCGAT GATGCGTCTC TCGTCGCTGC 90 
TGGGGGAATC GGCCACCTCC GAGTCACTGC TGTCCTCATC CTCCTGCTGG CCCCCAACAG WW 

CCCCCGTCAC ACAGGACTGC CGATTCTGGT AGGACTCCAT GGGGTTCACA ATGATGGT6A 1020 

CAGCTGAGTC ATCCCAGAAG AGGTCTGGGT CCTTGGGGTC ACTGGAGGCC CCTGGAGGCC 1080 
«CC^CCC CGGTGAAGGG AATGGATGCG CACCAGGCCC AGGACGACCA 140 

TTArCACCAG GAAGCCCACG CACACCACAA TGATGAGGGT TGCGGCGCTG GGTATCATGG 1200 
l™?c "g gS£c? AGGCTGTGTC CAGCCATCTC aggcgggggc tggtgaccac 26 

gItgcaggaa ctgctgggag ctgagcacgt ggctggggtg ggcaacccgg ttcatgctgt 1320 

GCAGGACATT GACCTCCACG ATGAATTCAT TGCTGGAGTA ACGGCCATTC ATTTCCGAGC 13SU 

S"CG HcTTCCTG GTGTAGAGGG CAGCTCCGTG TCaCAGCCGA TAACGAGCCT 440 
rrrTrACnAT CTCTTCATAC ACAGTGATGC TCTCCACCCC AGCAATAGTG AGGTAGGCAfa 

2££ SSSSS CCCCGCTGCT GCAGAGAGGT TGTGTCCAGG AGCAGGCTTT 560 

CCCGCTCGGG ATCCAGGTCA TCCCCCACCA GAGAAATTTC ACAGCCATCC AGGTTGTGCA 620 

CAATCTCATC CGACATGCGT GTGTCTGTCA CTGTGCCCTG CCAACTCTCA TCCTTTTTGG 16BU 

CCTCCACCTG GTGAGAAATG GAGCAGGTGA TTTGAAGATC AGGGAACAAA. K«0WHT 740 

TrrTTTCCTC AAAGTCCACA GCTGGGCGGG CAAAATGAGC AGTGCCACTC AGCAGGATCT 180U 

gSgmtc aIgSS acgaccacgt agccctccac ttcagggatg GAGACGCAGG 860 

ACTCTTC6CT GAAGCACTTG ACAGCAGTGG T6AGGCGCAG GGGCCTGACG KGGGCGTGG 19Z0 
JaAAGmEaG AG?GTTCATG TAAGCCACAT GCTGCAGGGC ATGGTTGAAG OTCTCCACAT 1980 

«S S25SSS ™ T cc S ESSE S g. 

CCTCGCGGCT 3£S C^GAGCGCA CGCTGAAACC AGCCAGGTAG CCATGGAAGT |M0 
AGTGGTGGAT CGACAAAGGG TCTCCTTGGG TGGTGTCTGT «TGTTGTCT =«TTTTCCT 
TCTCTTTGTT CTTCTCCTCA GTCCAGCAGG CCCCAATCAT GAGAGCAGGC TCCCTTCGBb 
GTGGGTGGAT SgGCCATTG TCATGGATGA GGGCAGGGTC 
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GCCCACTGGG 

GGCTGCTGTC 

GCATGGGCAA 

CTGAGTAGTT 

CCTGAATGTT 

AGCTCCCAGC 

AGGTGGGCTT 

CATAAGCTGT 

TCTCAATGTT 

AGCAGATCTG 

GATCGTACAG 

CAAACTCGTT 

TGGCCCCGTC 

GGGCCTCGCA 

GGATCACAGC 

CATAGCGCAG 

TGTCATTCTC 

CTTTGTTACA 

TCATGGTGCG 

CTGCGGATTC 

AAATTGTGTA 

GGGCCTCTGC 

TTTGCCATGG 

AGGATCTCTT 

AGACGAAGGT 

ATCTTTGGAT 



GCCACCCAGG 

CTGGCTGTAG 

CAGATCCACT 

GTCACGGTCA 

CCAGAGTGGT 

ACCTGGTGCA 

CACCTGAATC 

CACTGTAAAC 

CCCGTCATTG 

GCTGTACTGG 

CTTCCCCTCT 

CACATCGTTG 

GGGGCCCTCG 

GTCCACAGGC 

CTCAAAGGGC 

CGGGGCATCC 

CATGACGATG 

GGAGCAGGAC 

GCGTGGGGCA 

CACCTTGCGG 

GGAGGAGTCC 

AGGTTCAGAG 

CCTTGATTGC 

CTCCACAGTC 

CCTCTCGTAC 

CC 



GGCACCTGCA 

TGCACCGAGA 

TCCCCAGTGG 

CAGCCCTTGG 

TCATCACAGG 

TATTCGATCC 

TCCACCTCAG 

TTATAGAGCC 

TCAATGAGGA 

GGGGAGCAGT 

GTCACAGCCG 

ACCCGCACAT 

CCACAGTCAT 

TCCTTGGCCC 

ACCCCAGACC 

TTGTCCAAGG 

CCCTGGTACT 

GCGAGCAGAG 

GGGCAGGGCC 

GAGGGATACA 

AGCTTAGGAC 

GATCACTGCT 

AACAGCTGCC 

GTACTTCTGC 

CTCCCCGCTG 



CAGCCTGGGT 

GTCCTGCTGT 

CAGCACCACA 

CCACATGGCT 

TCTCCAGGCG 

TTTTGTTCCA 

CATCATCTGC 

TCTCACCACT 

AAGGGGTGTT 

CACCGTCAAT 

CACGATACAG 

GCACAGTGGC 

AGGCCTGGAT 

GGATCAGCCC 

CATGGAGCCG 

CAAAGAGTGG 

CTGCCTCAAT 

AGGCCAGCAG 

AGGCGTTTGC 

GGGGGGGAAA 

CGGGCCAGAG 

GCCACCACCG 

TCCTCTGTCA 

TCAATCTCCT 

TCCTGGAGCA 



GCCATTGAAC 

CCAGTTGGCA 

GAGTTTCCGC 

GGTCTGCAGC 

GATACCAGGG 

GCCTTGCCAG 

TGCCCGCTTC 

GTACTGCAGC 

GGGTGTGAGA 

GGCTTCCACC 

CCGTTCCACA 

CTTGTGGGAC 

GGTGAAGGTG 

CTCTCCTGTC 

GAAGCCGCAG 

TGGATTCAGT 

CCATGGCTTG 

AAGGGGCAGC 

CTCCCCTGGG 

ACCAAAATAA 

CCAGGCCAGG 

CCACCCTGGG 

TGGCAGACAG 

TGCCAAGGTC 

GTGATAGGTA 



CAGTAGATCA 

TTGGGGCCAG 

AGCGCCCGCT 

TCTATGGTGG 

AACAAAGCCA 

CTGGGTTTAC 

TTCCCACAGT 

TTCTCTGTGT 

ATCTCATAGT 

CGCAGGATGC 

AACACTGGGG 

TTCTTGGTGT 

TGTTCCTTCT 

GCCTTGTCAA 

ATCTCACCTG 

AGGACCGTGT 

TGCTTGTTGG 

AGCAGGAGGG 

AGCCTCCAGC 

AACGTCAAAT 

CTCGGGGAGG 

AGCCAGTTAT 

CACCGTGATC 

TCCCTCAGGG 

CCCATCCTGG 



2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4152 



(2) INFORMATION FOR SEQ ID NO:6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3117 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 



GGATCCAAAG 
AGAACAAGCG 
CTCCCCTGCA 
AGGGTCCAGG 
TGGCCCTGCT 
AGCAGGTGGC 
GTGGAGGAGG 
CCCATTCAAC 
ATCCTGGAGT 
CGGGGGGCCC 
AGGTAGGGCG 
AGTCACTGCC 
TCCGGCAGGG 
AGCAGCAGCA 
TCTTTGAGAA 
TGCAGCCAGC 
CCTTCCCCCC 
CCCCGCAGCC 
CTAAGGAGGG 
AGGCCACTGG 
TGGGGCAGCC 
TGCCTGATGG 
GCGAGGAGGG 
TGATCCAGAG 
CCCAGAAGGC 
AGAAGCGGAA 
CCTTAGCCAC 
CTGTGCCTGT 
ACGGGAAGGG 
GGCGGTCCAC 
AGTTGGAGGG 
TCATCCCCAC 
ACCAGAGCCA 
AGCTACCTCC 



ATTCGGCACG 
GAAGCGTTGC 
GCCCCCCCAG 
AGGGGCAGTC 
GAACTCTGTG 
CTCAGTAAAG 
TGGGGGTGTC 
ATGGAACTGC 
GGGAGTCCCG 
ACAGCTGGAC 
GCCCCAGGCA 
CCTGCAACCC 
CCCACCGCCC 
ACCACAGCAG 
CTTCTATTCC 
TGGGCCACTG 
CAACCCAGAT 
AGCGCTACCT 
TATCCTGCCT 
CAACCTGTTC 
CCATCCTGAA 
GGAGAGACTA 
CATGAGGGCA 
CACGCGACGG 
TGTGGAGCTG 
AAGTGTATTG 
CAAGCGAGCA 
GCGAACTGTG 
TCTTGAACAG 
CCGAATCCCC 
GGAGCCTTCC 
CAAGGCGGGC 
CCTGCGCTCT 
CTACACGCCG 



AGTGGCCACA 
CTCTTTGGGG 
CAGTCCATCA 
TCCACCTCTC 
GTGTATGGGC 
TGGCCCAACT 
AGTGACAGCA 
CACAGTCTGT 
ACTTACTATA 
CGCTATGTGC 
CCCCTGAATT 
TTCCAGCTGG 
CCAAACCCGG 
CAGCAGCAGC 
ATGCCACAGC 
GGACAGTCCC 
ATGAACCCAG 
CAGGTCCAGA 
CCCAGCGCCC 
CTACATCACT 
GCTCTGGGAT 
GCACCCAATG 
GTGAGCACAG 
AGGCGCCGGG 
GCCTCACTGC 
GCCTCAACTA 
CGAGAAGACA 
GACCCAACTG 
AACCCTGCTG 
GGGACAGATG 
GTGCGGAAAC 
ACTTTCATCG 
CCCGTGCGCC 
CCCCCCATCC 



TCATGAACCT 
GCCAGGAACC 
GAGTGAAGGA 
AGCCTGTGGA 
CTGAGCGGAC 
CTGTGATGGC 
GCTGGCAGCA 
CCCTCTACAG 
ACCACCCTGA 
GACCAATGAT 
CTTTCCACGC 
CATTCGGCCA 
TGGCTGCCTT 
AGCAGCAGGC 
AACCCTCGCA 
ACCTGGCTCA 
AACTGCGCAA 
TCCCCTTCCC 
TGGATGGGGC 
GGCCCCTGCA 
TCCCGCTGGA 
GCCGGGAGCG 
GGGACTGTGG 
CATCCCAGGA 
AGAATGCAAA 
CCAAGTGTGG 
GTGGGATGGT 
AGGCAGCCCA 
AGCACAAGCC 
CTCAAGCTCA 
CAAAGCAGCG 
CCCCTCCCGT 
TAGCTGACCA 
TCAGCCCTGT 



CCAGGCCCAG 
AGCTCCCAAG 
GGAGCAGTAC 
ACTGCCCCCT 
CTCAGCAGCC 
TCCAGGGCGG 
GCAGCCAGGC 
TGCAACCAAG 
GGCACTGAAG 
GCCACAGAAG 
AGCCAAGAAA 
CCAGGTGAAC 
CCCTCCACAG 
AGCCCTACCC 
GCAACCCCAG 
CCACAGCATG 
GGCCCTTCTG 
CCGCCGCTCC 
TGGCACCCAG 
GCAGCCGCCA 
GCTGAGGGAG 
AGAGGCTCCT 
GCAGGTGCTA 
GGCCAATTTG 
GGATGGCAGT 
GGTGGAGTTT 
ACCCCTCATC 
GGCTGGAGGT 
ATCAGTCATC 
GGCGGAGGAC 
GCCCAGGCCC 
CTACTCCAAC 
CCCCTCTGAG 
GCGGGAAGGC 



CCCAAGGCTC 
GAGCAGCCCC 
CTCGGGCACG 
CCTAGCAGCC 
ATGCTGTCCC 
GGCCCGGAGC 
CAGCCTCCAC 
GGGAGCCCGC 
CGGGAGAAAG 
GTGCAGCTGG 
CCCCCAAACC 
CGGCAGGTCT 
AAGCAGCAGC 
CAGATGCCGC 
GACTTTGGCC 
GCACCCTACC 
CAGGACTCAG 
CGCCGCCTCT 
CCTGGGCAGG 
CCTGGCTCCC 
TCGCAGCTAC 
GCCATGGGCA 
CGGGGCGGAG 
CTGACCCTGG 
GGTTCTGAAG 
TCTGAGCCTT 
ATCCCAGTGT 
CTTGATGAGG 
GTCACCCGCA 
ATGAATGTCA 
GAGCCCCTCA 
ATCACCCCAT 
CGGAGCTTTG 
TCTGGCCTCT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900. 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
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ACTTCAATGC CAT CAT AT CA ACCAGCACCA TCCCTGCCCC TCCTCCCATC ACGCCTAAGA 2100 

GTGCCCATCG CACGCTGCTC CGGACTAACA GTGCTGAAGT AACCCCGCCT GTCCTCTCTG 2160 

TGATGGGGGA G^CCACCCCA GTGAGCATCG AGCCACGGAT CAACGTGGGC TCCCMTTCC 2220 

IrrrARAAAT CCCCTTGATG AGGGACCGTG CCCTGGCAGC TGCAGATCCC CACAAGGCTG 2280 

ACTTGGTGTG GCAGCCATGG GAGGACCTAG AGAGCAGCCG GGAGAAGCAG AGGCAAGTGG 2340 

AAGACCTGCT GACAGCCGCC TGCTCCAGCA TTTTCCCTGG TGCTGGCACC AACCAGGAGC 2400 

tcmcctgca CTGTCTGCAC GAATCCAGAG GAGACATCCT GGAAACGCTG AATAAGCTGC 2460 

TGCTGAAGAA GCCCCTGCGG CCCCACAACC ATCCGCTGGC AACTTATCAC TACACAGGCT 2520 

J?S?CAGTG GAAGATGGCC GAGAGGAAGC TGTTCAACAA AGGCATTGCC ATCTACAAGA 2580 

S2S55S? CCTGGTGCAG AAGCTGATCC AGACCAAGAC CGTGGCCCAG TGCGTGGAGT 2640 

TCTACTACAC CTACMGAAG CAGGTGAAAA TCGGCCGCAA TGGGACTCTA ACCTTTGGGG 2700 

IStMWAC GAGCGATGAG AAGTCGGCCC AGGAAGAGGT TGAAGTGGAT ATTAAGACTT 2760 

ScAA^GU CCCAAGGGTG CCTCTTCCCA GAAGAGAGTC CCCAAGTGAA GAGAGGCTGG 2820 

AGCCCAAGAG gSgGTGAAG GAGCCCAGGA AGGAGGGG6A GGAGGAGGTG CCAGAGATCC 2880 

AAGAGAAGGA GGAGCAGGAA GAGGGGCGAG AGCGCAGCAG GCGGGCAGCG GCAGTCAAAG 2940 

CCACGCAGAC ACTACAGGCC AATGAGTCGG CCAGTGACAT CCTCATCCTC CGGAGCCACG 3000 

AGKC^C SGTCT GCCGGTGGCC AGGCCTCGGA GAAGCCAAGG JAAGGGACAG 3060 

GGAAGTCACG AAGGGCACTA CCTTTTTCAG AAAAAAAAAA AAAAAAACAA AAAGCTT 3117 

(2) INFORMATION FOR SEQ ID N0:7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3306 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS: double 
(0) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: 

GAATTCGGCA CGAGGTCAGT TTCCTGTGGA ACACAGAGGC TGCCTGTCCC ATTCAGACAA 60 
CGACGGATAC AGACCAGGCT TGCTCTATAA GGGATCCCAA CAGTGGATTT GTGTTTAATC 120 
5™?"gct AAACAGTTCG CAAGGATATA ACGTCTCTGG CATTGGGAAG ATTTTTATGT 180 
TTAATGTCTG CGGCACAATG CCTGTCTGTG GGACCATCCT GGGAAAACCT GCTTCTGGCT 240 

™S mSc? GAAGAGCTCA AGAATTGGAA GCCAGCAAGG CCAGTCGGAA 
TTGAGAAAAG CCTCCAGCTG TCCACAGAGG 6CTTCATCAC TCTGACCTAC AMGGGCCTC 
TCTCTGCCAA AGGTACCGCT GATGCTTTTA TCGTCCGCTT TGTTTGCAAT GATGATGTTT 

IScIgggcc CCTCAAATTC CTGCATCAAG atatcgactc tgggcaaggg atccgaaaca 

PTTACTTTGA GTTTGAAACC GCGTTGGCCT GTGTTCCTTC TCCAGTGGAC TGCCAAGTCA 
c C ^£t2£ ?Igaaatgag TACGACCTGA CTGGCCTAAG CACAGTCAGG AAACCTTGGA 
WCCTCTGTC GATGGGAGAA AGAGGACTTT CTATTTGAGC GTTTGCAATC 

■SSJSEJ c^cSgga TGCCAGGGCA GCGCAGTGGG GTCTTGCTTA GTGTCAGAAG 720 

gcS?agct2 G^ATCTGGGT GTGGTGCAGA TGAGTCCCCA AGCCGCGGCG aatggatctt 780 

Tr&GCATCAT GTATGTCAAC GGTGACAAGT GTGGGAACCA GCGCTTCTCC ACCAGGATCA 840 

^5™£g ?g^cagata TCGGGCTCAC CAGCATTTCA GCTTCAGGAT GGTTGTGAGT 900 

ACGTGTTTAT "gWGAACT GTGGAAGCCT GTCCCGTTGT CAGAGTGGAA GGGGACAACT 960 

"Sggtga^ 5g5S CATGGCAACT TGTATGACCT 6AAGCCCCTG GGCCTCAACG 020 

AcfcWTC^ GAGCGCTGGC GAATACACTT ATTACTTCCG GGTCTGTGGG AAGCTTTCCT 080 

PARACRTCTG CCCCACAAGT GACAAGTCCA AGGTGGTCTC CTCATGTCAG GAAAAGCGGG U4U 

^SgSggg WTTCACAAA GTGGCAGGTC TCCTGACTCA GAAGCTAACT TATGAAAATG 200 

£nG"2£ A^CHC ACGGGGGGGG ACACTTGCCA TAAGGTTTAT CAGCGCTCCA 260 

rarrrATCTT CTTCTACTGT GACCGCGGCA CCCAGCGGCC .AGTATTTCTA AAGGAGACTT 320 

^T«5tC CTACTTGTTT GAGTGGCGAA C6CAGTATGC CTGCCCACCT TTCGATCTGA 380. 

C^G^C ATTCAAAGAT GGGGCTGGCA ACTCCTTCGA CCTCTCGTCC CTGTCAAGGT 440 

S S CTGG^C ATCACTGGGA CGGGGGACCC GGAGCACTAC CTCATCAATG 500 

TCTGCAAGTC TCTGGCCCCG CAGGCTGGCA CTGAGCCGTG CCCTCCAGAA GCAGCCGCGT 560 

GTCTGCTGGG TGGCTCCAAG CCCGTGAACC TCGGCAGGGT AAGGGACGGA CCTCAGTGGA 620 

GAGATGGCAT AATTGTCCTG AAATACGTTG ATGGCGACTT ATGTCCAGAT G GGATTCGGA 680 

**A&rTPAAr TArrATCCGA TTCACCTGCA GCGAGAGCCA AGTGAACTCC AGGCCCATGT yw 

%SSS£ CG^GAG^ SaCA CCTTTGCCTG GCCCACAGCC ACAGCCTGTC 80 

CCATGAAGAG CAACGAGCAT GATGACTGCC AGGTCACCAA CCCAAGCACA GGACACCTGT I860 

TTGATCTGAG CTCCTTAAGT GGCA6GGCGG GATTCACAGC TGCTTACAGC GA«AGGGGT 920 

TGGTTTACAT GAGCATCTGT GGGGAGAATG AAAACTGCCC TCCTGGCGTG GGGGCCTGCT 1980 

TTGGACAGAC CAGGATTAGC GTGGGCAAGG CCAACAAGAG GCTGAGATAC GTGGACCAbb «»« 

GGTGTACAAG GATGGGTCCC CTTGTCCCTC JAAATCCGGC CTGAKTATA 2100 
ar«rtrT(!AT CAGTTTCGTG TGCAGGCCTG AGGCCGGGCC AACCAATAGG CCCATGCTCA 
S« SSSS TGCACTCTCT TCTTCTCCTG GCACACGCCG CTGGCCT G 
AGCAAGCGAC CGAATGTTCC GTGAGGAATG GAAGCTCTAT TGTTGACTTG JCTCCCCTTA 
TTCATCGCAC TGGTGGTTAT GAGGCTTATG ATGAGAGTGA GGATGATGCC TCCGATACCA 
ESrSSE CTACATCAAT ATTTGTCAGC CACTAAATCC CATGCACGGA GTGCCCTGTC 
CTGCCGGAGC CGCTGTGTGC AAAGTTCCTA TTGATGGTCC CCCCATAGAT ATCGGCCGGG 
TAGCAGGACC ACCAATACTC AATCCAATAG CAAATGAGAT TTACTTGAAT TTTGAAAGCA 
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300 
360 
420 
480 
540 
600 
660 



2160 
2220 
2280 
2340 
2400 
2460 
2520 



GTACTCCTTG CCAGGAATTC AGTTGTAAAT AAAATTGAAC CTGCTCAACA GCTGAGGGAG 2580 

ACTAGAAATG ATGGGTCCAT ATCCTGGTGC ATTGTCATAC AATTCAAACA ATGGTGCAGC 2640 

TACCAGCTTG TAATTTTTAG GGACTGCAAA CAAGGCTTTT TCTTGAAGCT GAACCAGAAA 2700 

CAACTTCTTA TGTTCCTTAG GCTTTGTAAT ATGTGCAGGA ATATATGGAT ACTGAGGAGG 2760 

TTCAAAATTT GGTCTCCACC AGTTACCAAT GCAATCGTCA ATGACCCAGT CTTGCAAAAC 2820 

TCCATCCTGA CGACCCAGTA TCTCTGTCAT TAAGCGTTTT AGTCCTTCAA CTTCATCTTC 2880 

TCCTGGGTTA AGTTCACCAC CAGGTAGTTT GAAGAAAGTT GTTCCCAGCT GCAGCAGTAA 2940 

CACATGGGGT AGCCGGTGCT CATGTACAAT CAGAACCCCT TCTACAGTCC TCCTCATTCC 3000 

AATTTTATCA AATTCTTCCC TCATGCGCTG AAATCTGGCT GCAACAGAGC TGTCCTTCTC 3060 

GTAGAGGGGC TCTTTTGTAC CAAAAGTATA ATTGGTAAGA GGGTACAGGT TGATGGTGCG 3120 

CTCCAGGGTG AGGGGCTTCG TCTGCTGGAT GTACTTGTTG CCGAACTGAG TGACCCCCCG 3180 

GGGCCAGCCG GTCTGCGAGC GATTGGGCGG TACCACAGAC ATGCTGGCGA GCTCCGGCGC 3240 

TGACGGCGAG CAGAAAGTGG CAGGCAGGGT AGACTTTCCC CGTGCGGGAA GCCTCGTGCC 3300 
GAATTC 



3306 



(2) INFORMATION FOR SEQ ID N0:8: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 4218 base pairs 
<B) TYPE: nucleic acid 

<C) STRANDEDNESS: double 
(D> TOPOLOGY: linear 

(ii) HOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:8: 

GAATTCGGCA CGAGAATGGA TCAACCTCAA CAACACGTTA AAGCTAGACG AAAGAAGTAA 60 

TACACA6TGT ATGAGTCTCA CATGAAATAC CCGGATGTAA ATCCAAAGAA ACAGGAAGCA 120 

GATTGGTGGT TGCCAGGGAC AAGGGCGGTG GGAGGAGAAA ATGGAGAGTA ACGGGACTTT 180 

ACTTTTGGAG TGATGAGAAT GTTTTGGAGC TAGATAGAAG TGGTGGTTGT ACACCATTGT 240 

GGATGTACTA CCACTTAATT GTTCACTTAA AAAGTTAATT TATGTGAATT GCATCTTAAT 300 

TAAAAACAAG GATAACATTC CAACTCCTGG ACATTATCCT TCCTTTCCAT TTGATGTCAG 360 

GCCCGTGTTA GAATTCTCAT CCGGTTTGGT CACTGCACTT AAGATGTGGA GAAATTAGGA 420 

CGCACAGTTA AGAGGAAGGA TAACACTGAT TAAGGTAGTG CTTTTCTAGG TTTCCCCTAA 480 

ACAATTTAAC AGATGGATAG TGGCACCACT TACGAGATGG AAAAACCAGC GGAAGGAAGA 540 

TTTGGGGGAG AAGTTAAGTT TGTCTTGGGC CTGTGTTTTG CAACCTGAGT GTAAAAGACA 600 

TATGTTAAGT CTTCAGTGGC GAAACACTAA AACTAGAAAT GGATCAGAAT TTTATCTTTG 660 

GATGTGACTT CTCAAGGATG GTCTTGTCAC TTCA6TGCCT GGTCAAATGA CAAGATGGGC 720 

AATCTTTTCC TGAAGGTCCA AGCACCTGAA CGTGGCAGGG TGACCCGATT CCGATTTGCT 780 

TAGAACAATC CTAGTTCATG CCTATTGTCC CTCATGTAAT TAATATCACT CTCAAAATGT 840 

CTCATTTTGT GCAATAAATT CTGCAACGTG ATGGCGCGAC TCTCGCGGCC CGAGCGGCCG 900 

GACCTTGTCT TCGAGGAAGA GGACCTCCCC TATGAGGAGG AAATCATGCG GAACCAATTC 960 

TCTGTCAAAT GCTGGCTTCA CTACATCGAG TTCAAACAGG GCGCCCCGAA GCCCAGGCTC 1020 

AATCAGCTAT ACGAGCGGGC ACTCAAGCTG CTGCCCTGCA GCTACAAACT CTGGTACCGA 1080 

TACCTGAAGG CGCGTCGGGC ACAGGTGAAG CATCGCTGTG TGACCGACCC TGCCTATGAA 1140 

GATGTCAACA ACTGTCATGA GAGGGCCTTT GTGTTCATGC ACAAGATGCC TCGTCTGTGG 1200 

CTAGATTACT GCCAGTTCCT CATGGACCAG GGGCGCGTCA CACACACCCG CCGCACCTTC 1260 

GACCGTGCCC TCCGGGCACT GCCCATCACG CAGCACTCTC GAATTTGGCC CCTGTATCTG 1320 

CGCTTCCTGC GCTCACACCC ACTGCCTGAG ACAGCTGTGC GAGGCTATCG GCGCTTCCTC 1380 

AAGCTGAGTC CTGAGAGTGC AGAGGAGTAC ATTGAGTACC TCAAGTCAAG TGACCGGCTG 1440 

GATGAGGCCG CCCAGCGCCT GGCCACCGTG GTGAACGACG AGCGTTTCGT GTCTAAGGCC 1500 

GGCAAGTCCA ACTACCAGCT GTGGCACGAG CTGTGCGACC TCATCTCCCA GAATCCGGAC 1560 

AAGGTACAGT CCCTCAATGT GGACGCCATC ATCCGCGGGG GCCTCACCCG CTTCACCGAC 1620. 

CAGCTGGGCA AGCTCTGGTG TTCTCTCGCC GACTACTACA TCCGCAGCGG CCATTTCGAG 1680 

AAGGCTCGGG ACGTGTACGA GGAGGCCATC CGGACAGTGA TGACCGTGCG GGACTTCACA 1740 

CAGGTGTTTG ACAGCTACGC CCAGTTCGAG GAGAGCATGA TCGCTGCAAA GATGGAGACC 1800 

GCCTCGGAGC TGGGGCGCGA GGAGGAGGAT GATGTGGACC TGGAGCTGCG CCTGGCCCGC 1860 

TTCGAGCAGC TCATCAGCCG GCGGCCCCTG CTGCTCAACA GCGTCTTGCT GCGCCAAAAC 1920 

CCACACCACG TGCACGAGTG GCACAAGCGT GTCGCCCTGC ACCAGGGCCG CCCCCGGGAG 1980 

ATCATCAACA CCTACACAGA GGCTGTGCAG ACGGTGGACC CCTTCAAGGC CACAGGCAAG 2040 

CCCCACACTC TGTGGGTGGC GTTTGCCAAG TTTTATGAGG ACAACGGACA GCTGGACGAT 2100 

GCCCGTGTCA TCCTGGAGAA GGCCACCAAG GTGAACTTCA AGCAGGTGGA TGACCTGGCA 2160 

AGCGTGTGGT GTCAGTGCGG AGAGCTGGAG CTCCGACACG AGAACTACGA TGAGGCCTTG 2220 

CGGCTGCTGC GAAAGGCCAC GGCGCTGCCT GCCCGCCGGG CCGAGTACTT TGATGGTTCA 2280 

GAGCCCGTGC AGAACCGCGT GTACAAGTCA CTGAAGGTCT GGTCCATGCT CGCCGACCTG 2340 

GAGGAGAGCC TCGGCACCTT CCAGTCCACC AAGGCCGTGT ACGACCGCAT CCTGGACCTG 2400 

CGTATCGCAA CACCCCAGAT CGTCATCAAC TATGCCATGT TCCTGGAGGA GCACAAGTAC 2460 

TTCGAGGAGA GCTTCAAGGC GTACGAGCGC GGCATCTCGC TGTTCAAGTG GCCCAACGTG 2520 

TCCGACATCT GGAGCACCTA CCTGACCAAA TTCATTGCCC GCTATGGGGG CCGCAAGCTG 2580 

GAGCGGGCAC GGGACCTGTT TGAACAGGCT CTGGACGGCT GCCCCCCAAA ATATGCCAAG 2640 

ACCTTGTACC TGCTGTACGC ACAGCTGGAG GAGGAGTGGG GCCTGGCCCG GCATGCCATG 2700 

GCCGTGTACG AGCGTGCCAC CAGGGCCGTG GAGCCCGCCC AGCAGTATGA CATGTTCAAC 2760 
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ATCTACATCA 
AAGGCCATTG 
ATGGAGTGCA 
ATCTGTGACC 
CATGGCAATG 
TACAACACGC 
GGCACCGTGT 
CAGCGGGCAG 
AAGATCCTGT 
GTCAACCCCG 
CCCAACGAGG 
GAAGACTGAC 
TTGCCTCAAA 
GTTTGAAGTG 
CTACATCCCT 
CCAGGGCATG 
GCCCGTGTCC 
GTCGGACCTA 
CCTGACTGGG 
ATGTTGAACA 
CCTGCCTGCT 
GTTTTCACCC 
TGCTCCTGCC 
GCAGCAGGTG 
TGCCGTATTG 



AGCGGGCGGC 
AGGTGCTGTC 
AGCTCGGGGA 
CCCGGACGAC 
AGGACACCAT 
AGGTCAACTT 
CTGACCTGGC 
AGCAGCTGGC 
TCGTGAGGAG 
AG GA GAT CCA 
TTCGGCTGGA 
CCGTCCCCTC 
GGTTTTCCCT 
TGGGCATTGG 
GAAAGGAAGT 
GGGTGCTGTG 
CTCTGTCCTC 
GAAAGAGCAG 
ATCCAGGCCT 
CAGCACTGGC 
TTCGTGGGTT 
TGGTTGGTGT 
CTGCACACAG 
GCATGGAGCC 
AGAGGGCT 



CGAGATCTAT 
GGACGAGCAC 
GATTGACCGC 
CGGCGCGTTC 
CAAGGAAATG 
CATGGCCTCG 
CCCTGGGCAG 
GGCTGAGGCG 
TGACGCCTCC 
GCTGGGCGAG 
GCAGCAGAGC 
GTGCCGAATT 
CGAAGTCACA 
TGGCCTGTGG 
TGAGCCTGCT 
AGGGCAGCAT 
TAGCCCACTA 
ACAATCTCTC 
TCTCCCCTGC 
CAAAGAGATG 
CAGAGTAGCT 
GCTTTGCTTT 
CGGAGCGACT 
AAGAGGAAGT 



GGGGTCACCC 
GCGCGTGAGA 
GCCCGGGCCA 
TGGCAGACGT 
CTGCGTATCC 
CAGATGCTCA 
AGTGGCATGG 
GAGCGTGACC 
CGGGAGGAGC 
GACGAGGACG 
GTGCCAGCCG 
CGGCACGAGC 
AATGTTTCAA 
CTGTGTCCTC 
CCTCCATCCG 
GCTAGGTGTG 
AGGCCCTGGC 
TGGGTCACCA 
CCATGTGAAT 
TCACCGTGGG 
GAGGCTTGTC 
GAGGGCACTT 
TTTCTAGGTA 
GTGACTGAAA 



ACACCCGCGG 
TGTGCCTGCG 
TCTACAGCTT 
GGAAGGACTT 
GGCGCAGCGT 
AGGTCTCGGG 
ACGACATGAA 
AGCCCTTGCG 
TGGCAGAGCT 
AGGACGAGAT 
CAGTGTTTGG 
AAGACCAGCC 
GGAATCTCAA 
TCTCTGTAGC 
CAGACCTCCC 
ACCGTGCTCC 
CCATTTGTGC 
GTCTGGCTAG 
TCCCAGGGGC 
AACCGAGGCT 
TGAGAGGAGT 
AGAAAGCCCA 
TGCTCTTGAT 
CTGTCCACTC 



CATCTACCAG 
GTTTGCAGAC 
CTGCTCCCAG 
TGAGGTCCGG 
GCAGGCCACG 
CAGTGCCACG 
GCTGCTGGAA 
CGCCCAGAGC 
GGCACAGCAG 
GGACCTGGAG 
GAGCCTGAAG 
CCCAGATCAT 
ATTTTACAAA 
TGTTTTCTCC 
TTTCCAGCGC 
TGGCCTCCAG 
TAAACAGGCA 
GAGCTGGTCT 
AGAGCCTGAA 
CTCTTCTCCT 
TGGAGTGCTG 
GCCCAGCCCT 
TTCTGCAGAA 
ATAGCCCGGC 



(2) INFORMATION FOR SEQ ID N0:9: 

(l) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 1187 base pairs 
(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
CD) TOPOLOGY: linear 

Cii) MOLECULE TYPE: Genomic DHA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:9: 



GAGCTCGCGC 
CAACGGTGTA 
AGGAGCTTCC 
CAGTGTCTGA 
TCTTCCCGGA 
CTGTTGCTTT 
GGGGCCCAGA 
AGCTCATGGA 
ATGTCCTGTC 
AGAAGATGGT 
AGCTGCAACA 
TCCTCCAGCA 
AGAAGCAGCA 
TGGAGAAGAC 
GGCGCCCTGT 
TCTTCTCCGC 
CAAAAGACCA 
ATCTTGGCAC 
GGCGAAAACC 
TCTAGAGCGG 



GCCTGCAGGT 
CGAGTGGAGG 
TGAGCAGGTG 
GGGGACTGAC 
ATCAGATTCA 
GCAGATCACA 
TGCCCTGACA 
GCTTGAGATC 
TGTGAGCCAG 
TACCATGGTG 
CCTGTTTATG 
AAAGCTGAAG 
GGAGGCACTT 
CAAGGAGCTG 
GAACCTGATG 
TTTTGGGATG 
GGCTGACTGG 
TCTCCATGTT 
ACAGATTCTC 
CCGCGGGCCC 



CGACACTAGT 
ACAGGGACAG 
GCAGAAGATG 
TCTGGCATCT 
AAGGATCCTG 
GTGCTGGAAG 
CTGCTTGAAT 
TTCTTAGCCC 
TTCCAGCTGG 
TCAGTGCTGG 
ATCCTGGCCT 
CAGTCCCAGC 
GAGGAGCAGG 
CAGAAGCTGA 
GGAACCTCTC 
AAGATGATAG 
AAGATGGAAA 
CTCTACAAGA 
CTTCTAGTTA 
ATCGATTTTC 



GGATCCAAAG 
AGCCCTCTGT 
CGATTGACTG 
CTGCCGAGGC 
GAGGTGATGG 
CAGGAACCCA 
ACACTGAGAC 
AGAGAGCAGT 
CTCCAGCCAT 
AGGATCTGAT 
CACCAAGGTA 
TGCTGGCTTT 
CGGCTCTGGA 
TTGAAGCT6A 
TGTGACACCC 
CCAGGGCTGT 
GCCACAGGAA 
AGCTGTGGTG 
GTATAGCGCA 
CACCCGGGTG 



AATTCGGCAC 
GGTGGAACGA 
GGGCGACTTT 
TGCTGGAATC 
GATAGACTGG 
GGCTCCAGAA 
CCGGAATCAG 
GGAGTTGAGT 
CCTGCAGGGC 
TGGCAAGCTT 
TGTGGACCGA 
GAAGAAAGAG 
GCCTAAGCTG 
CATCTCCAAG 
TCCGTGTTCT 
TGTTTTGGGG 
GGAAGCGGCA 
ATTGGCCCTG 
AAAAGCTTCT 
GGGTACC 



GAGGGAAACT 
CCCCACCTCG 
GGGGTAGAGG 
GACTGGGGCA 
GGAGACGATG 
GGTGTTGCCA 
TTCCTTGATG 
GAGGAGGCAG 
CAGACCAAAG 
ACCAGTCTTC 
GTGACTGAAT 
CTGATGGTGC 
GACCTGCTAC 
AGGTACAGCG 
TGCCTGCCCA 
CCCTTCAAGG 
CCTGATGGTG 
TGGTCTATCA 
CGAGAGTACT 



2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3S40 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4218 



<2) INFORMATION FOR SEQ ID N0:10: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 3306 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

CCCTCACTAA AGGGAACAAA AGCTGGAGCT CGCGCGCCTG CAGGTCGACA CTAGTGGATC 
GAAAGTTCGT TACGCCAAGC TCGAAATTAA CTCTGGGCTG ACCCATAAAC ATTTGTCTGA 
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180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960. 
1020 
1080 
1140 
1187 
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TCTAGGATAT 
GTGGCCTGCT 
ATCACCCGCA 
TTGGAGGCTG 
ATCAGAATCT 
AGGGGAAGGT 
TGTCCGAAGT 
CCCCCCAAGG 
AAAGACAGTT 
CTCTGCGGTG 
GTGGGCCCCA 
GCTGGACAGT 
GGATTTGCTG 
TAATCCCCTA 
GCTGTAAGCC 
CGGGGGCCGG 
TCCGCCGCCG 
CGCCGGGACA 
CCGGGCTGGG 
ACCAGGCCCC 
TGCAGAGGCG 
GGAACACGGG 
ACATCCCCAC 
ACATCGTGAA 
ACGGCTTAAA 
AGTTCCTGGA 
AGTGGACCTT 
GCGTGCTGGG 
GTGACTTCAT 
CTTCCATGAA 
TGCAGCTTCA 
CCACGCTGGA 
TGGAGATGAT 
GCCCATCCCC 
CCAGCCCCGG 
CTGTGCCCCC 
CCCCACGGCG 
CTCCACCTCC 
ACTTTGTTCC 
ACGAGAAGAA 
TGGGCGGCAA 
GTGGGAG7CC 
AGGAGGAAGC 
CAGAGCCAGA 
CCACGAGGAC 
AGCGCAGCAG 
CGTCGGAGAG 
ACCCCGACTT 
GAGATGACCC 
CCAAGCTCCC 
TGGAGGAGGC 
AGGAGAAGAA 
ATCGATTTTC 
CGTATT 



AGTTGCGTTT 
GGATCAGGAC 
TCTTATCCAT 
CATGTTTGAT 
CACATTCTGC 
GCTGGATGAA 
TGAACTCCAC 
CTTCCTTCTG 
CCACGTCATT 
CTAGGAGAGC 
GGATCAGGCC 
GTGGCCTTGC 
AGCACCAATA 
GGGGCGGGTG 
AGCCATTCCT 
AGCGGAGCAG 
GGACCCCGGC 
TGGTGCCAGT 
ACGTCCGAGC 
GGGCCGGGAC 
CTTCGCCAAG 
CAAGACAGCG 
ACAGGAGATC 
GGTTGAAGTC 
GATGGAGAAC 
CGTGTACAAG 
CAATTACATT 
GAACTACCGG 
CGACAACCTG 
GAACAGCTTC 
GAGGGAGACG 
GGAGCTGTCG 
GGAGGCTCGC 
GGGCTCCCAG 
CACACCCCAG 
TGTACCACCC 
CAGCATCATC 
AGAGCCAGTC 
TGACGACCGC 
GGTGGGGGCC 
CCCGATGGTG 
CCCGCTGCCT 
AGAAGTGGCA 
GACCAAGTGG 
CGCAGCACCC 
CACCAGGCCC 
TGACCCCGAG 
TGAGAGCGAG 
CTCCGATGTG 
TCTCCCCGCC 
CGGACCCAAG 
AAAAAAAACA 
CACCCGGGTG 



CTTGCGGGCA 
ATCCCAGCGG 
GATGGCATTG 
GGCCCATGTG 
CTTGCTCTTT 
GGTAAACCCC 
TGCGCAATTC 
GATTCGGAAA 
TCCACATTCA 
GGCTGGGCCC 
CCGCGTGGCC 
GTTCGCCCCC 
CCTCACGGTT 
GGGGCACGGG 
TGATTTCTTA 
CCGCGGCTGA 
CTCTGGCCGC 
CGCACCCCTT 
GGGAAGATGT 
AAGAACATCC 
GGGGTGCAGT 
CTGTGGCACC 
CAGGTCACCA 
TGGGATGTAG 
GACCCCCAGG 
AACTGCAACG 
CTCCGGGAGC 
GACATGGGCG 
GACAGACCTC 
GGCCTAAAGT 
CTGTTGCGGC 
GTGCAGCAGG 
AGCCGTGGCC 
TCACCAGTCC 
CCCGCCCCAC 
TCAGAGGCCC 
TCTAGGCTGT 
CCGGCCGCAC 
CTGGACCGCA 
AAGGCTGCCC 
GCAGGGTTCC 
GCAGGCCCCG 
GCTCCCACAA 
TCCTCCATAC 
CCCTGGCCAG 
CCTGCTGAGA 
GGACCCATTG 
GGATCAGACA 
ACTGACGAGG 
TTCAGACTGA 
GAGAGCAGTG 
AAAAGCTTCT 
GGGTACCAGG 



GCAATCTGGA 
CCAGCATAGT 
GTACCCAGGA 
GTCTTGCCAG 
GGTCCAACGG 
GGGAGGACAG 
TTCACCAGGA 
GCAATGCCCA 
AAATCCGCAA 
GCAGACTGGG 
TGGAGAAGCC 
GGGAGCGCTG 
GCCAACCTGG 
AGGAAGGATG 
GAATTAACTA 
GGTTCCCGAG 
GCCGGCTCCG 
CCCCGCCGCC 
TTTCCGCCCT 
CCGCCGGGCT 
ACAACATGAA 
GCCTGCAGGG 
GCATCCACTG 
TAGACAAAGG 
AGNCGGAGTC 
GGGTGGTCAT 
TTCCAAAAGT 
AGCACCGAGT 
CAGGTTCCTC 
ACCTTCATAA 
AGCTGGAGAC 
AGACGGAGGA 
ATGCGTCCCC 
TGCCTGCACC 
AGCTGCCCCT 
TGCCCCCACC 
TTGGGACGTC 
AGGGCCCAGC 
GCTTCCTGGA 
AGCAGGACAG 
AG6ACGATGT 
TCCCCAGTCA 
AAGGCCCTGC 
CAGCTTCGAA 
GCGGTGTCTC 
TGGAGCCGGG 
CTGCACAAAT 
CACAGCGCAG 
ATGAGGGCCC 
AGAATGACTC 
AGGAAGGTAA 
CGAGAGTACT 
TAAGTGTACC 



TGAGGCGGTT 
TCCGCTGCCG 
TGTTGTACTT 
CAGCAGGCAG 
TGCCCCGGAT 
AACAGTAGGG 
CATGAGGATA 
TCCACTTTCC 
AGCAGCCAAT 
GGGAAAGCTC 
CAGTCTGGGC 
CGAGTGTCGC 
GGTTTTAGCT 
GGCCAGCTGG 
AACGGTCGCG 
TCGGCCGCTC 
GCCTCCGGGG 
GCTGAGCTCG 
GAAGAAGCTG 
GCAGTCCATG 
GATAGTGATC 
CCGGCCGTTC 
GAGCTACAAG 
AAAATGCAAA 
TGAAATGGCC 
GATGTTCGAC 
GCCCACCCAC 
CATCCTGCCG 
CTACTTCCGC 
GTTCTTCAAT 
GAACCAGCTG 
CCAGAACTAC 
ACTGGCGGCC 
CGCTGTGTCC 
CAATGCTGCC 
TGCGTGCCCC 
ACCTGCCACC 
AACGGTCCAG 
AGACACAACC 
TGACAGTGAT 
GGACCTCGAA 
AGACATCACT 
CCCAGCTCCC 
GCCACGGAGG 
TGTTCGCACA 
GAAGGGTGAG 
GCTGTCCTTC 
GGCGGATGAC 
TGCCGAGCCG 
GGACCTCTTC 
GGAGGGCAAA 
TCTAGAGCGG 
CAATTCGCCC 



GAGGCACTGG 
GCGTAGGCCC 
CTTGGAAGGG 
GCCCACCATC 
ACGCTCACTA 
CTCTGCTCTC 
GAGGGCCTGA 
ATTCTTGGTA 
CACCGGAGAG 
CGCAGCCGCA 
TGGAGCGGGA 
GGCCTCGGGT 
CCCTTGGTTT 
GTGCAATCCT 
CCGGAGGCCG 
GGGGCTGCGC 
GGGCCGGGGC 
CCGGCCGCGC 
GTGGGGTCGG 
AACCAGGCGT 
CGGGGAGACA 
GTGGAGGAGT 
ACCACGGATG 
AAGCGAGGCG 
CTGGATGCTG 
ATTACCAAGC 
GTGCCAGTGT 
GACGACGTGC 
TATGCTGAGT 
ATCCCATTTT 
GACATGGACG 
GGCATCTTCC 
AACGGGCAGA 
ACGGGGAGCT 
CCACCATCCT 
TCAGCCCCCG 
GAGGCAGCCC 
AGTGTGGAGG 
CCCGCCAGGG 
GGGGAGGCCC 
GACCAGCCAC 
CTTTCGAGTG 
CAGCAGTGCT 
GGGACAGCTC 
GGTCCGGAGA 
CAGGCCTCCT 
GTCATGGATG 
TTTCCCGTGC 
CCCCCACCCC 
GGGCTGGGGC 
ACCCCCTCTA 
CCGCGGGCCC 
TATAGTGAGT 



180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
24O0 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3306- 



C2> INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 



Cxi) SEQUENCE DESCRIPTION: SEQ ID N0:11: 
TGCGGGGCCA GAGTGGGCTG 

<2) INFORMATION FOR SEQ ID NO: 12: 



Ci) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
<B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 
CD) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

GCAGTCCTGG CCTGCGGATG 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
CB) TYPE: nucleic acid 
(C) STRANDEDNESS: single 
CD) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

GTCGACAGGA 6AATTGGTTC 

(2) INFORMATION FOR SEQ ID NO:U: 

(i) SEQUENCE CHARACTERISTICS: 
CA) LENGTH: 20 base pairs 
<B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
CD) TOPOLOGY: linear 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

GCCTGGGTTC GGTGCGGGAC 

(2) INFORMATION FOR SEQ ID N0:15: 

Ci) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 20 base pairs 
<B) TYPE: nucleic acid 
CO STRANDEDNESS: single 
<D) TOPOLOGY: linear 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

TGGTCGGGTG TTTGTGAGTG 

C2) INFORMATION FOR SEQ ID NO: 16: 

Ci) SEQUENCE CHARACTERISTICS: 
CA) LENGTH: 20 base pairs 
(B) TYPE: nucleic acid 
CO STRANDEDNESS: single 
(D) TOPOLOGY: linear 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
CCTCTTCCGT CTCCTCAGTG 

C2) INFORMATION FOR SEQ ID NO: 17: 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
GGATTGCTAG TCTCACAGAC 



(2) INFORMATION FOR SEQ ID N0:18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
TTAAGGGTGG CTGAAGGGAC 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
CD) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:19: 
ACCTTCCCTC CCTGTCACAG 

. (2) INFORMATION FOR SEQ ID NO:20: 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY : linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

TGGTCGGGTG TTTGTGAGTG 

(2) INFORMATION FOR SEQ ID NO:21: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 



<xi> SEQUENCE DESCRIPTION: SEQ ID N0:21: 
ACACCATTCC AGAAATTCAG 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:22: 
AAACTGCAGG TGGCTGAGTC 

(2) INFORMATION FOR SEQ ID N0:23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID HO: 23: 
GTCCTAATGT TTTCAGGGAG 

<2) INFORMATION FOR SEQ ID NO:24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) * STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 
AAAACCTATG GTTACAATTC 

(2) INFORMATION FOR SEQ ID NO:25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:25: 

TCCTAGACAT GGTTCAAGTG 

(2) INFORMATION FOR SEQ ID NO:26: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 20 base pairs 
<B) TYPE: nucleic acid 
(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
GATATAATTA GTTCTCCATC 

(2) INFORMATION FOR SEQ ID NO: 27: 

<i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 
ATGCCTGTTC CAGGCTGCAC 

(2) INFORMATION FOR SEQ ID NO:28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 
GGACGGCGAC CTCCACCCAC 

<2) INFORMATION FOR SEQ ID NO:29: 



(I) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:29: 
GGGCTCCTCC GACGCCTGAG 

(2) INFORMATION FOR SEQ ID NO:30: 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
CO STRANDEDNESS : single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:30: 
AGTCTAGCCC TGGCCTTGAC 

(2> INFORMATION FOR SEQ ID N0:31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
<C) STRAND EDNESS: single 
(D) TOPOLOGY : linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:31: 

GTCACTGGGG ACTCCGGCAG 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
<B) TYPE: nucleic acid 
(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: 
CAGCTTTCCC TGGGCACATG 

(2) INFORMATION FOR SEQ ID N0:33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
CACAGCTGTC TCAAGCCCAG 

(2) INFORMATION FOR SEQ ID NO:3A: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:34: 



AC7GTTCCCC CTACATGATG 



(2) INFORMATION FOR SEQ ID NO:35: 

(j) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
ATCATATCCT CTTGCTGGTC 

<2> INFORMATION FOR SEQ ID NO:36: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO:36: 

GTTCCCAGAG CTTGTCTGTG 

(2) INFORMATION FOR SEQ ID N0:37: 

<i> SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



<xi> SEQUENCE DESCRIPTION: SEQ ID N0:37: 

GTTTGGCAGA CTCATAGTTG 

<2) INFORMATION FOR SEQ ID N0:3S: 

<i> SEQUENCE CHARACTERISTICS: 
(A> LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:38: 
TAGCAGGGAG CCATGACCTG 

(2) INFORMATION FOR SEQ ID N0:39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:39: 
CTTGGCGCCA GAAGCGAGAG 

(2) INFORMATION FOR SEQ ID NO:40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid . 

(C) STRANDEDNESS: single 
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<D> TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID N0:40: 
CCTCTCTCTC TCTCTCTCTC 

(2) INFORMATION FOR SEQ ID NO:41: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDED NESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:41: 
TCCCCGCTGA TTCCGCCAAG 

(2) INFORMATION FOR SEQ ID NO:42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:42: 
CTTTTTGAAT TCGGCACGAG 

£2) INFORMATION FOR SEQ ID NO:43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 
CCCCTGGTCC GCACCAGTTC 

(2) INFORMATION FOR SEQ ID NO: 44: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base. pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:44: 
GAGAAGGGTC GGGGCGGCAG 

(2) INFORMATION FOR SEQ ID NO:45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:45: 
AAATCACATC GCGTCAACAC 

(2) INFORMATION FOR SEQ ID NO: 46: 

— 62- 



(1) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO:46: 

TAAGAGAGTC ATAGTTACTC 

<2) INFORMATION FOR SEQ ID NO:47: 

(l) SEQUENCE CHARACTERISTICS: 
. <A> LENGTH: 20 base pairs 
(B> TYPE: nucleic acid 
(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID N0:47: 
GCTCTAGAAG TACTCTCGAG 

<2) INFORMATION FOR SEQ ID NO:48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO:48: 
ACTCTGGCCA TCAGGAGATC 

(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO:49: 
CAGGCGTTGT AGATGTTCTG 

(2) INFORMATION FOR SEQ ID NO:50: 

(!) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

AGTGGCAGGC AGAAGTAATG 

(2) INFORMATION FOR SEQ ID NO:51: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
<B) TYPE: nucleic acid 
(C) STRANDEDNESS: single 
CD) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:51: 
GGTTGGAGAA CTGGATGTAG 

(2) INFORMATION FOR SEQ ID NO:52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

CTATTCAGAT GCAACGCCAG 

(2) INFORMATION FOR SEQ ID NO:53: 

(l) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
<B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:53 

CCATGGCACA CAGAGCAGAC 

(2) INFORMATION FOR SEQ ID NO:54: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:54: 
GCTACCATGC AGAGACACAG 

(2) INFORMATION FOR SEQ ID N0:55: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:55: 

CAGGCTGACA AGAAAATCAG 

(2) INFORMATION FOR SEQ ID N0:56: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<0) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 
GGCACGCATA GAG GAG AG AC 

(2) INFORMATION FOR SEQ ID NO: 57: 



(l) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 
TGGGTGATGC CTTT6CTGAC 

(2) INFORMATION FOR SEQ ID NO:58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
CO STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(XI) SEQUENCE DESCRIPTION: SEQ ID NO:58: 

AAAACAAGAT CAAGGTGATG 

(2) INFORMATION FOR SEQ ID N0:59: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
<B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:59: 
TTGCCCACAT TGCTATGGTG 

(2) INFORMATION FOR SEQ ID N0:60: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 

GACCAAGATC AGAAGTAGAG 

(2) INFORMATION FOR SEQ ID NO:61: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 20 base pairs 
(B) TYPE: nucleic acid 
<C> STRANDEDNESS: single 
(D) TOPOLOGY : I inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:61: 

CCCCTGGGCC AATGATGTTG 

(2) INFORMATION FOR SEQ ID NO:62: 

<i> SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 19 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 



TCTTCCCACC ATAGCAATG 
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(2) INFORMATION FOR SEQ ID NO:63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS: single 

(D) TOPOLOGY: linear 



(Xt) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 
TGGTCTTGGT GACCAATGTG 

(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:64: 
ACACCTCGGT GACCCCTGTG 

(2) INFORMATION FOR SEQ ID N0:65: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:65: 
TCTCCAAGTT CGGCACAGTG 

(2) INFORMATION FOR SEQ ID N0:66: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:66: 
ACATGGGCTG CACTCACGAC 

(2) INFORMATION FOR SEQ ID N0:67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:67: 
GATCCTCTGA ACCTGCAGAG 

(2) INFORMATION FOR SEQ ID NO:68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID N0:68: 

GGAAATGAGG TGGGGCGATC 

(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 
CA) LENGTH: 20 base pairs 
(B) TYPE: nucleic acid 
CO STRAND EDNESS: single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:69: 
CTTTGCCTTG GACAAGGATG 

(23 INFORMATION FOR SEQ ID N0:70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 
GCACCTGCCA TTGGGGGTAG 

(2) INFORMATION FOR SEQ ID N0:71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:71: 
GGTGGAAGCC ATTGACGGTG 

(2) INFORMATION FOR SEQ ID N0:72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:72: 
TGCGTCTCTC GTCGCTGCTG 

(2) INFORMATION FOR SEQ ID N0:73: 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:73: 
GCGGAAACTC TGTGGTGCTG 

(2) INFORMATION FOR SEQ ID NO: 74: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
CB) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D ) TOPOLOGY: linear 



(XI) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 
AGGATTGCCT TCCTCTACTG 

(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
CO STRANDEDNESS: single 
CD) TOPOLOGY: linear 



<xi> SEQUENCE DESCRIPTION: SEQ ID NO:75: 
TGTCTGTTTC ACCAGGGCAG 

(2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



Cxi) SEQUENCE DESCRIPTION: SEQ ID N0:76: 

CCAGTGCCTC TATGCATGTC 

(2) INFORMATION FOR SEQ ID N0:77: 

CO SEQUENCE CHARACTERISTICS: 
CA) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
CD) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:77: 

AGGAAGCCCA CGCACACCAC 

C2) INFORMATION FOR SEQ ID NO: 78: 

Ci) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
CB) TYPE: nucleic acid 
CO STRANDEDNESS: single 
CD) TOPOLOGY: linear 



Cxi) SEQUENCE DESCRIPTION: SEQ ID N0:78: 
CCCTTTGTTC CCTGATCTTC 

C2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 

CA) LENGTH: 20 base pairs 

CB) TYPE: nucleic acid 
(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO:79: 
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CGCTCGGGAT CCAGGTCATC 



(2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
(8) TYPE: nucleic acid 
<C> STRANDEDNESS: single 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:80: 
TCGAGGTTCA GAGCGTAGTG 

(2) INFORMATION FOR SEQ ID NO:81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:81: 
TCTTGGATCT CTGGCACCTC 

(2) INFORMATION FOR SEQ ID N0:82: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS: single 
CD) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:82: 

CCATCAGAGT GAAGGAGGAG 

(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
<B) TYPE: nucleic acid 
(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 

CCATCTTCCA CTGGTCAGAG 

(2) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 

CTCCTTCTCT TGGATCTCTG 

(2) INFORMATION FOR SEQ ID NO:85: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 20 base pairs 
(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 
CD) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ 10 NO: 85: 
TTACTTCAGC ACTGTTAGTC 

(2) INFORMATION FOR SEQ ID N0:86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 
AGGGAGGTAG CTCAAAGCTC 

(2) INFORMATION FOR SEQ ID N0:87: 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D> TOPOLOGY: linear 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 

TGGGTCCACA GTTCGCACAG 

C2> INFORMATION FOR SEQ ID NO:88: 

<i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
CB) TYPE: nucleic acid 
CO STRANDEDNESS: single 
CD) TOPOLOGY: linear 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 
CAACTCTGTG ATGGCTCCAG 

(2) INFORMATION FOR SEQ ID N0:89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
CO STRANDEDNESS: single 
CD) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:89: 
AGCAGGGTTC TGTTCAAGAC 

(2) INFORMATION FOR SEQ ID N0:90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO:90: 
CCATTGGGTG CTAGTCTCTC 



(2) INFORMATION FOR SEQ ID N0:91: 

<i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
CB) TYPE: nucleic acid 
(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:91: 
CAGCCATGCT GTCCCAGCAG 

(2) INFORMATION FOR SEQ ID NO:92: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ 10 N0:92: 

CTGGACCTGA GGTAGCGCTG 

(2) INFORMATION FOR SEQ ID NO:93: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
<B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID N0:93 
ATAACCACCC TGAGGCACTG 

(2) INFORMATION FOR SEQ ID NO:94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
CD) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:94: 
CCTGCAGGTC GACACTAGTG 

(2) INFORMATION FOR SEQ ID NO:95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 
AATTGGAATG AGGAGGACTG 

(2) INFORMATION FOR SEQ ID NO:96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
CD) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:96: 
GCTCTAGAAG TACTCTC6AG 

(2) INFORMATION FOR SEQ ID N0:97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:97: 

ATTGTATGAC AATGCACCAG 

(2) INFORMATION FOR SEQ ID NO:98: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
CB) TYPE: nucleic acid 

(C) STRAHDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:98: 
TCCACAGAGG GCTTCATCAC 

(2) INFORMATION FOR SEQ ID NO:99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 
CCTGACTGGC CTAAGCACAG 

(2) INFORMATION FOR SEQ ID NO: 100: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 
AAGCCTCATA ACCACCAGTG 

(2) INFORMATION FOR SEQ ID NO: 101: 

(!) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
CO STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 
TGTCAACGGT GACAAGTGTG 

(2) INFORMATION FOR SEQ ID NO: 102: 



(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDED NESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102: 
TTGTACACCA GCTGCAGGTC 

(2) INFORMATION FOR SEQ ID NO: 103: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:103: 
GGGTGTGGTG CAGATGAGTC 

(2) INFORMATION FOR SEQ ID NO: 104: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104: 
ATCACACTCT TATAGCTCAG 

(2) INFORMATION FOR SEQ ID NO:105: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 105: 
GTGGGAAGCT TTCCTCAGAC 

(2) INFORMATION FOR SEQ ID NO:106: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: Linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106: 
TGATGAACAT GGGCCTGGAG 

(2) INFORMATION FOR SEQ ID NO: 107: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:107: 



CATTGTGGAT GTACTACCAC 

(2) INFORMATION FOR SEQ ID NO: 108: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
CB) TYPE: nucleic acid 
(C) STRANDEDNESS: singte 
<D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108: 
TGTGTTTTGC AACCTGAGTG 

(2) INFORMATION FOR SEQ ID NO: 109: 

(i> SEQUENCE CHARACTERISTICS: 
CA) LENGTH: 20 base pairs 
<B) TYPE: nucleic acid 
(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 
ATAGTGGCAC CACTTACGAG 

(2) INFORMATION FOR SEQ ID NO: 110: 

<i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 20 base pairs 
(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 



<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 110: 

AATTCTGCAA CGTGATGGCG 

(2) INFORMATION FOR SEQ 10 NO: 111: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111: 
CACAAGATGC CTCGTCTGTG 

<2> INFORMATION FOR SEQ ID NO:112: 

(i) SEQUENCE CHARACTERISTICS: 
(A> LENGTH: 20 base pairs 
(6) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112: 
AATCCGGACA AGGTACAGTC 

(2) INFORMATION FOR SEQ ID NO: 113: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113: 

GCACGAGTGG CACAAGCGTG 

(2) INFORMATION FOR SEQ ID NO: 114: 

(i) SEQUENCE CHARACTERISTICS: 
CA) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114: 
GCAAGCGTGT GGTGTCAGTG 

(2) INFORMATION FOR SEQ ID NO:115: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
<C) STRAND ED NESS: single 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115: 
TGTTTGAACA GGCTCTGGAC 

(2) INFORMATION FOR SEQ ID NO:116: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:116: 
CGGCATGGCA ATGAGGACAC 

(2) INFORMATION FOR SEQ ID NO: 117: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117: 
AGGACGAGAT GGACCTCCAG 

(2) INFORMATION FOR SEQ ID NO: 118: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 118: 
CCCTCTGTCC TCTAGCCCAC 

(2) INFORMATION FOR SEQ ID N0:119: 

— 75 — 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119: 
TCTTGAGGGG ACTGACTCTG 

<2) INFORMATION FOR SEQ ID NO: 120: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
<C> STRANDEDNESS: single 
(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 120: 
TGAGTGAGGA GGCAGATGTC 

(2) INFORMATION FOR SEQ ID NO: 121 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 121: 
TGGCTTTGAA GAAAGAGCTG 

(2) INFORMATION FOR SEQ ID NO: 122: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(D> TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122: 
GCAAAAGACC AGGCTGACTG 

(2) INFORMATION FOR SEQ ID NO: 123: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 123: 
TGCAGCTCCT TGGTCTTCTC 

(2) INFORMATION FOR SEQ ID NO: 124: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID HO: 124: 



GATTCACAGT CCCAAGGCTC 

(2) INFORMATION FOR SEQ ID NO:125: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 125: 
ATCTGGATGA GGCGGTTGAG 

<2) INFORMATION FOR SEQ ID NO:126: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
<B) TYPE: nucleic acid 
(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:126: 
GGTCACTCTC CGACGAGGAG 

(2) INFORMATION FOR SEQ ID NO: 127: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 20 base pairs 
(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 127: 
GGATCCAAAG TTCGTCTCTG 

(2) INFORMATION FOR SEQ ID NO: 128: 

<i) SEQUENCE CHARACTERISTICS: 
. (A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:128: 
CGCTGTGTGT CTGATCCCTC 

(2) INFORMATION FOR SEQ ID NO: 129: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:129 
ATGAAGGTAA ACCCCGGGAG 

(2) INFORMATION FOR SEQ ID NO: 130: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
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(8) TYPE: nucleic acid 
(C) STRANDEDNESS: single 
<D> TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:13Q: 
TGGTCTCTGG CTCTGAGCAC 

(2) INFORMATION FOR SEQ ID N0:131: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:131: 
GCCTGGAGAA GCCCAGTCTG 

(2) INFORMATION FOR SEQ ID NO: 132: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 132: 
CACACTCTGG ACCGTTGCTG 

<2> INFORMATION FOR SEQ ID NO: 133: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 133: 
AAAGCTCCGC AGCCGCAGTG 

<2) INFORMATION FOR SEQ ID NO: 134: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 20 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 134: 
TCTTCCAGGA AGCTGCGGTC 

(2) INFORMATION FOR SEQ ID NO: 135: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:135: 
GATGGTGGGG CAGCATTGAG 
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(2) INFORMATION FOR SEQ ID NO: 136: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
<B> TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ 10 NO:136: 
GTCACCAGTG GTGCCTGCAG 

(2) INFORMATION FOR SEQ ID HO: 137: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:137: 
ACCTCACGGT TGCCAACCTG 

(2) INFORMATION FOR SEQ ID N0:138: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 138: 
CGCAACAGCG TCTCCCTCTG 

(2) INFORMATION FOR SEQ ID NO: 139: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 139: 
AGTACCTTCA TAAGTTCTTC 

(2) INFORMATION FOR SEQ ID NO: 140: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID N0:K0: 
TCCCAGACTT CAACCTTCAC 

(2) INFORMATION FOR SEQ ID N0:H1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:141: 
AAACATCTTC CCGGTCGGAC 

(2) INFORMATION FOR SEQ ID N0:U2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 142: 
GCTGAGCACC TTTACCTCAC 

(2) INFORMATION FOR SEO ID NO:143: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 143: 
GACGTCCGTC CGGGAAGATG 

(2) INFORMATION FOR SEQ ID NO: 144: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:144: 
ACACAGGAGA TGCAGGTCAC 

(2) INFORMATION FOR SEQ ID NO: 145: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 145: 
GAGTCTTCCA TGAAGAACAG 

(2) INFORMATION FOR SEQ ID NO: 146: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 146 
GCAGTGAGGA AGGTAAGGAG 

(2) INFORMATION FOR SEQ ID NO: 147: 



Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4047 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EONESS: double 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 
<B) LOCATION: 378... 1799 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 147: 

GGATCCAAAG GACGCCCCCG CCGACAGGAG AATTGGTTCC CGGGCCCGCG GCGATGCCCC 60 

CCCGGTAGCT CGGGCCCGTG GTCGGGTGTT TGTGAGTGTT TCTATGTGGG AGAAGGAGGA 120 

GGAGGAGGAA GAAGAAGCAA CGATTTGTCT TCTCGGCTGG TCTCCCCCCG GCTCTACATG 180 

TTCCCCGCAC TGAGGAGACG GAAGAGGAGC CGTAGCCGCC CCCCCTCCCG GCCCGGATTA 240 

TAGTCTCTCG CCACAGCGGC CTCGGCCTCC CCTTGGATTC AGACGCCGAT TCGCCCAGTG 300 

TTTGGGAAAT GGGAAGTAAT GACAGCTGGC ACCTGAACTA AGTACTTTTA TAGGCAACAC 360 

CATTCCAGAA ATTCAGG ATG AAT GGG GAT ATG CCC CAT GTC CCC ATT ACT 410 
Met Asn Gly Asp Met Pro His Val Pro lie Thr 
15 10 



ACT CTT GCG GGG ATT GCT ACT CTC ACA GAC CTC CTG AAC CAG CTG CCT 458 
U Thr Leu Ala Gly lie Ala Ser Leu Thr Asp Leu Leu Asn Gin Leu Pro 

,J 15 20 25 

^ CTT CCA TCT CCT TTA CCT GCT ACA ACT ACA AAG AGC CTT CTC TTT AAT 506 

I'll Leu Pro Ser Pro Leu Pro Ala Thr Thr Thr Lys Ser Leu Leu Phe Asn 

W GCA CGA ATA GCA GAA GAG GTG AAC TGC CTT TTG GCT TGT AGG GAT GAC 554 

i.i Ala Arg He Ala Glu Glu Val Asn Cys Leu Leu Ala Cys Arg Asp Asp 

45 50 55 

■ AAT TTG GTT TCA CAG CTT GTC CAT AGC CTC AAC CAG GTA TCA ACA GAT 602 

I'll Asn Leu Val Ser Gin Leu Val His Ser Leu Asn Gin Val Ser Thr Asp 

\H 60 65 70 75 



CAC ATA GAG TTG AAA GAT AAC CTT GGC AGT GAT GAC CCA GAA GGT GAC 650 
His lie Glu Leu Lys Asp Asn Leu Gly Ser Asp Asp Pro Glu Gly Asp 
80 85 90 

ATA CCA GTC TTG TTG CAG GCC GTC CTG GCA AGG AGT CCT AAT GTT TTC 698 
lie Pro Val Leu Leu Gin Ala Val Leu Ala Arg Ser Pro Asn Val Phe 
95 100 105 

AGG GAG AAA AGC ATG CAG AAC AGA TAT GTA CAA AGT GGA ATG ATG ATG 746 
Arg Glu Lys Ser Met Gin Asn Arg Tyr Val Gin Ser Gly Met Met Met 
110 115 120 

TCT CAG TAT AAA CTT TCT CAG AAT TCC ATG CAC AGT AGT CCT GCA TCT 794 
Ser Gin Tyr Lys Leu Ser Gin Asn Ser Met His Ser Ser Pro Ala Ser 
125 130 135 

TCC AAT TAT CAA CAA ACC ACT ATC TCA CAT AGC CCC TCC AGC CGG TTT 842 
Ser Asn Tyr Gin Gin Thr Thr He Ser His Ser Pro Ser Ser Arg Phe 
140 14 5 150 155 

GTG CCA CCA CAG ACA AGC TCT GGG AAC AGA TTT ATG CCA CAG CAA AAT 890 
Val Pro Pro Gin Thr Ser Ser Gly Asn Arg Phe Met Pro Gin Gtn Asn 
160 165 170 

AGC CCA GTG CCT AGT CCA TAC GCC CCA CAA AGC CCT GCA GGA TAC ATG 938 
Ser Pro Val Pro Ser Pro Tyr Ala Pro Gin Ser Pro Ala Gly Tyr Met 
175 180 185 



CCA TAT TCC CAT CCT TCA AGT TAC ACA ACA CAT CCA CAG ATG CAA CAA . 
Pro Tyr Ser His Pro Ser Ser Tyr Thr Thr His Pro Gin Met Gin Gin 
190 1°5 200 



986 
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GCA TCG GTA TCA AGT CCC ATT GTT GCA GGT GGT TTG AGA AAC ATA CAT 1034 
Ala Ser Val Ser Ser Pro He Val Ala Gly Gly Leu Arg Asn lie His 
205 210 215 

GAT AAT AAA GTT TCT GGT CCG TTG TCT GGC AAT TCA GCT AAT CAT CAT 1082 
Asp Asn Lys VaL Ser Gly Pro Leu Ser Gly Asn Ser Ala Asn His His 
220 225 230 235 

GCT GAT AAT CCT AGA CAT GGT TCA AGT GAG GAC TAC CTA CAC ATG GTG 1130 
Ala Asp Asn Pro Arg His Gly Sen Ser Glu Asp Tyr Leu His Met Val 
240 245 250 

CAC AGG CTA AGT AGT GAC GAT GGA GAT TCT TCA ACA ATG AGG AAT GCT 1178 : 
His Arg Leu Ser Ser Asp Asp Gly Asp Ser Ser Thr Met Arg Asn Ala 
255 260 265 

GCA TCT TTT CCC TTG AGA TCT CCA CAG CCA GTA TGC TCC CCT GCT GGA 1226 
Ala Ser Phe Pro Leu Arg Ser Pro Gin Pro Val Cys Ser Pro Ala Gly 
270 275 280 

AGT GAA GGA ACT CCT AAA GGC TCA AGA CCA CCT TTA ATC CTA CAA TCT 1274 
Ser Glu Gly Thr Pro Lys Gly Ser Arg Pro Pro Leu lie Leu Gin Ser 
285 290 2°5 

CAG TCT CTA CCT TGT TCA TCA CCT CGA GAT GTT CCA CCA GAT ATC TTG 1322 
Gin Ser Leu Pro Cys Ser Ser Pro Arg Asp Val Pro Pro Asp He Leu 
300 305 310 315 

CTA GAT TCT CCA GAA AGA AAA CAA AAG AAG CAG AAG AAA ATG AAA TTA 1370 
Leu Asp Ser Pro Glu Arg Lys Gin Lys Lys Gin Lys Lys Met Lys Leu 
320 325 330 

GGC AAG GAT GAA AAA GAG CAG AGT GAG AAA GCG. GCA ATG TAT GAT ATA 1418 
Gly Lys Asp Glu Lys Glu Gin Ser Glu Lys Ala Ala Met Tyr Asp lie 
335 340 345 

ATT AGT TCT CCA TCC AAG GAC TCT ACT AAA CTT ACA TTA AGA CTT TCT 1466 
I le Ser Ser Pro Ser Lys Asp Ser Thr Lys Leu Thr Leu Arg Leu Ser 
350 355 360 

CGT GTA AGG TCT TCA GAC ATG GAC CAG CAA GAG GAT ATG ATT TCT GGT 1514 
Arg Val Arg Ser Ser Asp Met Asp Gin Gin Glu Asp Met lie Ser Gly 
365 370 375 

GTG GAA AAT AGC AAT GTT TCA GAA AAT GAT ATT CCT TTT AAT GTG CAG 1562 
Val Glu Asn Ser Asn Vat Ser Glu Asn Asp lie Pro Phe Asn Val Gin 
380 385 390 395 

TAC CCA GGA CAG ACT TCA AAA ACA CCC ATT ACT CCA CAA GAT ATA AAC 1610 
Tyr Pro Gly Gin Thr Ser Lys Thr Pro He Thr Pro Gin Asp lie Asn 
400 405 410 

CGC CCA CTA AAT GCT GCT CAA TGT TTG TCG CAG CAA GAA CAA ACA GCA 1658 
Arg Pro Leu Asn Ala Ala Gin Cys Leu Ser Gin Gin Glu Gin Thr Ala 
415 420 ^25 

TTC CTT CCA GCA AAT CAA GTG CCT GTT TTA CAA CAG AAC ACT TCA GTT 1706 
Phe Leu Pro Ala Asn Gin Val Pro Val Leu Gin Gin Asn Thr Ser Val 
430 435 440 

GCT GCA AAA CAA CCC CAG ACC AAT AGT CAC AAA ACC TTG GTG CAG CCT 1754 
Ala Ala Lys GLn Pro Gin Thr Asn Ser His Lys Thr Leu Val Gin Pro 
445 450 455 

GGA ACA GGC ATA GAG GTC TCA GCA GAG CTG CCC AAG GAC AAG ACC TAAGA 1804 
Glv Thr Gly He Glu Val Ser Ala Glu Leu Pro Lys Asp Lys Thr 
460 465 470 

TCCAGCAGGG AACTATGTAG TCACCCCGAG AGGCCCAGCT CTCTCCGTGA GCTCTGGGCC 1864 

TAGGGTGGGG GTGGTTGTTG GTTCTGCGCG CACTGTTCCC PCTACATGAT GGGTCCATCC 924 

CAGTTGGCTT CTCTCACTCG CTTCCTCCTG TGGAGAAGCC TGTCCAGGTG TMCTGCCTC 1984 

CAGGAAGCTG TCTCTGATTT CTCCAGTTGA ACAGTGAGAT TTGCCACACC TCACATGCAT 2044 

CGCTCTTGTC CCTGGAATTG TAACCATAGG T TTT CCT GTC TCCTGGAGGA CAAGGATGAG 2104 
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GGCTTTCCAC TTGAGTCTCC CTGGTGGAGC CCAGCTCCTG ACATACCTGG TAAAAGTTCT 2164 
CaSSgA ACATGGAGGA GGAATGTGGA TAACAACCCT GGCTGCCTGT GTGTTCCAAG 2224 
CTAGGAAGAT GTAATGTCCC CACAAACGGG GTAAATGGCT TGCCTGCGTC ACAGCTGTCT 2284 

cISgS CCCTGGGCGC CAGCCCAAGC ccaaggacta ggtccagagc cacacagcgc 2344 

CAGGCCACAT CCGCCTCACC TGGGACCCTT TGTGGGGTAC AGTCTCCGGC CCCACCCAGA 2404 

CCTCCTGAAG GAGAGACCCC ATGGCAAGGA CTCAGCCACC TGCAGTTTCA TAAGCCCCCA 2464 

rTGGGTTCCT AGGCATGAAG ACCACCGGTT AGAGGCTGAA CT6GCAGGAA CCTGTCTCCA 2524 

gJcCCHCK ^Sg GGCCCTGCCT CAGAGGCAGC ACCCAGGACG TGGCCATGAC 2584 

CCgSSc CACTCAATCC CTCTTCTCCA GGAGCCATGC AAAGTGTCAG CCAGCCAGGC 2644 

CCCTGGAAGG CAGTCATCAC CTCTTAAGGC ATTGTGGGTG TCGGTCCTGC AACTGCCAGG 2704 

?gcIgCACAC GACCCGTGTC CGGTGTTCGA TAGCAGGGAG CCATGACCTG GCAACGATTC 2764 

MCGC?C^ GGGGCACCCG GGGGGCCCTG GGTCGGGGCG GATCAGCTTT CCCTGGGCAC 2824 

S?CTGCCTCA TTCCAGATCT CCAGGGCTCA TGTCTGTGAC AGGGAGGGAA GGCTCTGCCC 2884 

?GGCCTTCCG TCAGCTCTGC CAGTGCAGGC TGGGCAGCCT GGGCTTTAGA GCTGGCTTCT 2944 

GCCCACACTT TCTCCGTGAA AGGAAAACAA CTATGAGTCT GCCAAACGCA TCTCAGATGC 3004 

G?mAAAAA ATTCTGGTCC CCGCTCTCTG TCCCATCATC CGCCTCGGGG ACTTCCTCTC 3064 

TrrrxrGTTC TCACCCCATA CTCTGTCACT GCCACATTTT CACCTGGGCC TGGCCTTTGT 3124 

££££££ AA^S AATCTTGAAA TGGATTTCTA GGTCACTGGG GACTCCGGCA 3184 

GCACATTCGG "tCAGAATA AAGGGCGCCC GCGGTCCCCC AGCACCTCCC CAAGCCACAC 3244 

CCCTAGCTTC CCTCCCTATC CCTGCAGCCT GAGGGTCCCT TCAGCCACCC TTAAGTCCCC 3304 

CTGCCCCGCC CCTGGCTAGC AGCGCCTTCT CCACCGGGGC CCCCTCTGCT 3364 

CACAGAGCCC CCTCACCTCC CTGGGGATGA GGGGCCAGGC CATWCCCTG AWGTCTAGC 3424 
CCTGGCCTTG ACCTCCCAGG AGCGCCCTCC CCGCCCTCTC CCGGCGCCGG CCCCGTCCTC 

SScKc CTCTGGGTCG TGCCCCGCAG ACTGAGCTGC GCTTGGGGGT CCTGGCGGCC 3544 

IgGGCCGTCCCGCACCGAAC CCAGGCGGTC GGAGCCCGGC GGGAAGGCGC GAGGTCCTTC 3604 

U TGGGGGCTCC TCCGACGCCT GAGGGCGCTG CTTCCCCGCG JCCGCCCCGG GTTTCTGCGG 3664 

\ ^ AGCCGGGGCC TCCGCTCTCG GGTGACCCGG TGAGACCCCC GGGGAGGCCG CTGGGGAGGC 37Z4 

'1 GCGGGCTCTG CTCCCGGGTC CCAAACGCAC TGGCTGCCCC TCAGGAGGGA CGGCGACCTC 3784 

CACCCACGGC GCTGGCGCCC GCACGGCCGC TCCTCCCGCT CCCGCAGCCT MCGCCTCC 3844 

' n CGAGGCCGCC CCGCCGGGCC CCACGCGCGG CCCCATCCGC AGGCCAGGAC T6CCTTCCCG 3904 

GAGCTGGCGG CCCCCAGCCT GGAGGAGCCG GCCCCAGACG CCCTCCCAGC CCTCCCCAGC 3964 

CCACTCTGGC CCCGCAGCCC CCGCCTGGTC CGAGTGCGGG TCTCTGGCCC CGGCCTTTCC 40rt 
CGGGGAAGGA AAGCAAAAAG CTT 

i.u 

- 3 (2) INFORMATION FOR SEQ, ID NO: 148: 

* * (i) SEQUENCE CHARACTERISTICS: 

\J, (A) LENGTH: 474 amino acids 

in <B) TYPE: amino acid 

<C> STRAND EDNESS : single 
j'LJ (D) TOPOLOGY: Linear. 

!:« Cii) MOLECULE TYPE: protein 

U 3 (v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ IDN0:148: 

Met Asn Gly Asp Met Pro His Val Pro lie Thr Thr Leu Ala Gly lie 

Ala Ser Leu Thr Asp Leu Leu Asn Gin Leu Pro Leu Pro Ser Pro Leu 

20 25 
Pro Ala Thr Thr Thr Lys Ser Leu Leu Phe Asn Ala Arg lie Ala Glu 

35 4° 45 , 

Glu Val Asn Cys Leu Leu Ala Cys Arg Asp Asp Asn Leu Val Ser Gin 

50 ' 55 60 

Leu Val His Ser Leu Asn Gin Val Ser Thr Asp His lie Glu Leu Lys 
65 70 75 ou 

Asp Asn Leu Gly Ser Asp Asp Pro Glu Gly Asp Ue Pro Val Leu Leu 

85 00 95 

Gin Ala Vat Leu Ala Arg Ser Pro Asn Val Phe Arg Glu Lys Ser Met 

100 105 11° 

Gin Asn Arg Tyr Val Gin Ser Gly Met Met Met Ser Gin Tyr Lys Leu 

115 12° 125 , 

Ser Gin Asn Ser Met His Ser Ser Pro Ala Ser Ser Asn Tyr Gin Gin 

130 135 140 

Thr Thr lie Ser His Ser Pro Ser Ser Arg Phe Val Pro Pro Gin Thr 



vCi riu — - ■ '--a ■ 

U5 150 155 16° 

Ser Ser Gly Asn Arg Phe Met Pro Gin Gin Asn Ser Pro Val Pro Ser 

165 I 70 . 

Pro Tyr Ala Pro Gin Ser Pro Ala Gly Tyr Met Pro Tyr Ser His Pro 

180 185 1VU 

Sen Ser Tyr Thr Thr His Pro Gin Met Gin Gin Ala Ser Val Ser Ser 
195 200 205 
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Pro lie Vat Ala Gly Gly Leu Arg Asn He His Asp Asn Lys Val Ser 

210 215 220 

Gly Pro Leu Ser Gly Asn Ser Ala Asn His His Ala Asp Asn Pro Arg 
225 230 235 « u 

His Gly Ser Ser Glu Asp Tyr Leu His Met Val His Arg Leu Ser Ser 

245 250 " 3 

Asp Asp Gly Asp Ser Ser Thr Met Arg Asn Ala Ala Ser Phe Pro Leu 

260 265 CfV 

Arg Ser Pro Gin Pro Val Cys Ser Pro Ala Gly Ser Glu Gly Thr Pro 

275 280 285 

Lys Gly Ser Arg Pro Pro Leu He Leu Gin Ser Gin Ser Leu Pro Cys 

290 295 300 

Ser Ser Pro Arg Asp Val Pro Pro Asp He Leu Leu Asp Ser Pro Glu 
t 05 310 315 ■ wo 

Arg Lys Gin Lys Lys Gin Lys Lys Met Lys Leu Gly Lys Asp Glu Lys 

325 330 ■ i - >3 

Glu Gin Ser Glu Lys Ala Ala Met Tyr Asp He lie Ser Ser Pro Ser 

340 345 
Lys Asp Ser Thr Lys Leu Thr Leu Arg Leu Ser Arg Val Arg Ser Ser 

355 560 365 

Asp Met Asp Gin Gin Glu Asp Met lie Ser Gly Val Glu Asn Ser Asn 

370 375 380 

Val Ser Glu Asn Asp He Pro Phe Asn Val Gin Tyr Pro Gly Gin Thr 
385 390 395 

Ser Lys Thr Pro lie Thr Pro Gin Asp He Asn Arg Pro Leu Asn Ala 
il 405 410 ^l0 

i 1 Ala Gin Cys Leu Ser Gin Gin Glu Gin Thr Ala Phe Leu Pro Ala Asn 

! ~ 420 425 

K " 4 Gin Val Pro Val Leu Gin Gin Asn Thr Ser Val Ala Ala Lys Gin Pro 

|fl 435 440 445 

Gin Thr Asn Ser His Lys Thr Leu Val Gin Pro Gly Thr Gly lie Glu 
)Z 450 455 *60 

Val Ser Ala Glu Leu Pro Lys Asp Lys Thr 
U 465 470 

U <2> INFORMATION FOR SEQ ID NO: 149: 



ry 



(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 2998 base pairs 
(8) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(IX) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 26... 799 
(D) OTHER INFORMATION: 

(Xi> SEQUENCE DESCRIPTION: SEQ ID NO:149: 

AAGCTTTTTG AATTCGGCAC GAGAT GCT ACA CJ0 GCT AT* TTT GAA ATA CTG 

1 5 

GAG AAA TCC TGG TTG CCC CAG AAT TGT ACA CTG GTT GAT ATG AAG ATT 
Glu Lys Ser Trp Leu Pro Gin Asn Cys Thr Leu Val Asp Met Lys lie 
10 15 20 

GAA TTT GGT GTT GAT GTA ACC ACC AAA GAA ATT GTT CTT GCT GAT GTT 
Glu Phe Gly Val Asp Val Thr Thr Lys Glu lie Val Leu Ala Asp Val 
30 3 5 * u 

ATT GAC AAT GAT TCC TGG AGA CTC TGG CCA TCA GGA GAT CGA AGC CAA 
III Asp Asn Asp Ser Trp Arg Leu Trp Pro Ser Gly Asp Arg Ser Gin 
45 50 " 

CAG AAA GAC AAA CAG TCT TAT CGG GAC CTC AAA GAA GTA ACT CCT GAA 
Gin Lys Asp Lys Gin Ser Tyr Arg Asp Leu Lys Glu Val Thr Pro Glu 
60 65 ™ 

GGG CTC CAA ATG GTA AAG AAA AAC TTT GAG TGG GTT GCA GAG AGA GTA 
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52 



100 



148 



196 



244 



292 



Gly Leu Gin Met Val Lys Lys Asn Phe Glu Trp Val Ala Glu Arg Val 
75 80 85 

GAG TTG CTT TTG AAA TCA GAA ACT CAG TGC AGG GTT GTA GTG TTG ATG 340 
Glu Leu Leu Leu Lys Ser Glu Ser Gin Cys Arg Val Val Val Leu Met 
90 95 100 105 

GGC TCT ACT TCT GAT CTT GGT CAC TGT GAA AAA ATC AAG AAG GCC TGT 388 
Gly Ser Thr Ser Asp Leu Gly His Cys Glu Lys He Lys Lys Ala Cys 
110 115 120 

GGA AAT TTT GGC ATT CCA TGT GAA CTT CGA GTA ACA TCT GCG CAT AAA 436 
GLy Asn Phe Gly He Pro Cys Glu Leu Arg Val Thr Ser Ala His Lys 
125 130 135 

GGA CCA GAT GAA ACT CTG AGG ATT AAA GCT GAG TAT GAA GGG GAT GGC 484 
Gly Pro Asp Glu Thr Leu Arg He Lys Ala Glu Tyr Glu Gly Asp Gly 
140 1« 150 

ATT CCT ACT GTA TTT GTG GCA GTG GCA GGC AGA AGT AAT GGT TTG GGA 532 
He Pro Thr Val Phe Val Ala Val Ala Gly Arg Ser Asn Gly Leu Gly 
155 160 165 

CCA GTG ATG TCT GGG AAC ACT GCA TAT CCA GTT ATC AGC TGT CCT CCC 580 
Pro Vat Met Ser Gly Asn Thr Ala Tyr Pro Val He Ser Cys Pro Pro 
170 175 180 185 

CTC ACA CCA GAC TGG GGA GTT CAG GAT GTG TGG TCT TCT CTT CGA CTA 628 
Leu Thr Pro Asp Trp Gly Val Gin Asp Val Trp Ser Ser Leu Arg Leu 
190 195 zuu 

CCC AGT GGT CTT GGC TGT TCA ACC GTA CTT TCT CCA GAA GGA TCA GCT 676 
Pro ser Gly Leu Gly Cys Ser Thr Val Leu Ser Pro Glu Gly Ser Ala 
205 210 215 

CAA TTT GCT GCT CAG ATA TTT GGG TTA AGC AAC CAT TTG GTA TGG AGC 724 
Gin Phe Ala Ala Gin lie Phe Gly Leu Ser Asn His Leu Val Trp Ser 
220 225 230 

AAA CTG CGA GCA AGC ATT TTG AAC ACA TGG ATT TCC TTG AAG CAG GCT 772 
Lys Leu Arg Ala Ser lie Leu Asn Thr Trp lie Ser Leu Lys Gin Ala 
235 240 245 

GAC AAG AAA ATC AGA GAA TGT AAT TTA TAAGAAAGAA TGCCATTGAA TTTTTTA 826 
Asp Lys Lys lie Arg Glu Cys Asn Leu 
250 255 

GGGGAAAAAC TACAAATTTC TAATTTAGCT GAAGGAAAAT CAAGCAAGAT GAAAAGGTAA 886 

SSJEJE GAGAACACAA ATAAAATGTA TTAGTGAATA AATGGTGAGG GTAGGCCTAT 946 

TCAGATGCAA GGCCAGCAAT GGGGCTCCCC ATTATCCCCA CCCCTTTGGT CCCAGTCCCC 006 

TTCTCTGCAA TGGGCACGCA TAGAGGAGAG ACAAAGGGTA TTAGACGCAA CATCATTGGC 066 
CCAGGGGAGT CCGAGAAGAG CTGCCATTGG CTGACAGGGC ATTTTCAGGC TCTGTCATTG 26 

GTCAGGGAGC ACACCCCAGC CTGAAGAGTG ATGCCATTGG CCAGGGAGTG GTTTTGTCAT U86 

S^GC mGAAGTGG AAGGAAAAGA TCTGGGAATG AAGCCCTGTG gccaggaaga 246. 

TAGACAGGGC AGCAACTTCT GGGCCTCCAG GCCCTCTTCC CACCATAGCA ATGTGGGCAA 306 

AACTGGTGTC AGGCCCCAGC CAGAAAAAGG AGCCCAAGCC agagggcaag tgacaaagga 366 

TGTACCATGT CCAATCTCCC ACACCCTGGG GCTGCCCTTC CCAATGTCTT TCTTGATAGC 1426 

CAAGTTGGGC TGGGAGCAGC TCACTGCTCC TCTAGCCAGG AGGGTTTCTC agctcctgga 486 

GGCCGCAGCT TGATGTTGAA CTGCTGCAGG GTCTGCTCCA GCTGTTTCTG GTTCCCAGCA 546 

AAGTAGGCGG ACACAGCATT GTGGAAGAGC AGCAGCTGCT tgtgcatcac cttgatcttg 606 

TTTTCTTCCA GGAACTTGAG CTTGATGGCC ACATCTCCCC GCAGCTTCTC ATACTTGTCC 1666 

CGATGGGCCT GGAAAGTGGC CTGGGCACTC TCAAGTCGAC CACGTGTCCC tgcatcccgg 726 

GGGCCTAGAC TCAGCTCCTC TAAGTCTGTT CGGTAGGCAT CATATTCCAG CCTGGCAGCC 786 

TCATACTGTT TCACAGTCAT GAGCGTGTCT TCCATGGTCT TGGTGACCAA TGTGTTGATG 1846 

JtAGAGaSS AGAAGTTCAC GGCTCCTAGC AGCGTTTCCC CATTCTTGCA TAGTAGTTTC 906 

TGTGTCTCTG WvlTcC AAATTCCTCC TGAAGCTCTG GGGACTTCTG GCTGAGGTCA 1966 

GCAAAGGCAT CACCCAGTGC ATGCTGGGTC TGCAGCAGGC TGTAGAGGTG "CTCTCAGT 2026 

GCCCGGCCCA GCTGCAGGAC ACTCTCATAC TTGCGCTTCG TCTCACGCAG CAACTCAATC 2086 

TGCAGCTCTA GCTCCAGGAT TCCGGCGCCT CCACTCCGTC CCCCGCGGGT CTGCTCTGTG 2146 

TGCCATGGAC GGCATTGTCC CAGATATAGC CGTTGGTACA AAGCGGGGAT CTGACGAGCT 2206 

TTTCTCTACT TGTGTCACTA ACGGACCGTT TATCATGAGC AGCAACTCGG CTTCTGCAGC 2266 

AaIcGG^AT GACAGCAAGA AGTTCAAAGG TGACAGCCGA AGTGCAGGCG TCCCCTCTAG 2326 

AGTGATCCAC ATCCGGAAGC TCCCCATCGA CGTCACGGAG GGGGAAGTCA TCTCCCTGGG 2386 
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rrTGCCCTTT GGGAAGGTCA CCAACCTCCT GATGCTGAAG GGGAAAAACC AGGCCTTCAT 2446 
^tgmc ACGGAGGAGG CTGCCAATAC CATGGTGAAC TACTACACCT CGGTGACCCC 2506 

5S£E£m GGCCAGCCCA tctacatcca gttctccaac cacaaggagc tgaagaccga 2566 

CAGCTCTCCC AACCAGGCGC GGGCCCAGGC GGCCCTGCAG GCGGTGAACT CGGTCCAGTC 2626 

gg™c?g GCCTTGGCTG cctcggcggc ggccgtggat gcagggatgg cgatggccgg 2686 

GCAGAGCCCC GTGCTCAGGA TCATCGTGGA GAACCTCTTC TACCCTGTGA CCCTGGATGT 2746 

>rTrr!rrAfi attttctCCA AGTTCGGCAC AGTGTTGAAG ATCATCACCT TCACCAAGAA 2806 

£S ££S StATGC GGACCCCGTG AGCGCCCAGC ACGCCAAGCT 2866 

nrrrrmr RRRrAfiAACA TCTACAACGC CTGCTGCACG CTGCGCATCG ACTTTTCCAA &£X> 

£lSc C?cS AGTACAACAA TGACAAGAGC CGTGACTACC TCGTGCCGAA 2986 
TTCTTTGGAT CC 

(2) INFORMATION FOR SEQ ID NO: 150: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 258 amino acids 

(B) TYPE: amino acid 

(C) ST RAND ED NESS: single 

(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 150: 

□ Ala Thr Gin Ala He Phe Glu He Leu Gtu Lys Ser Trp Leu Pro Gin 

1 1 1 5 1° 1S 

51 Asn Cys Thr Leu Val Asp Met Lys He Glu Phe Gly Val Asp Val Thr 

1 20 25 

1.1 Thr Lys Glu He Vat Leu Ala Asp Val He Asp Asn Asp Ser Trp Arg 

pa 35 40 45 

! * Leu Trp Pro Ser Gly Asp Arg Ser Gin Gin Lys Asp Lys Gin Ser Tyr 

50 55 60 

|,J Arg Asp Leu Lys Glu Val Thr Pro Glu Gly Leu Gin Met Val Lys Lys 

Asn Phe Glu Trp Val Ala Glu Arg Val Glu Leu Leu Leu Lys Ser Glu 
. • 85 90 95 

l.J. Ser Gin Cys Arg Val Val Val Leu Met Gly Ser Thr Ser Asp Leu Gly 

: 100 1°5 110 

1 y His Cys Glu Lys He Lys Lys Ala Cys Gly Asn Phe Gly He Pro Cys 

flj 115 i 20 25 

, 1 Glu Leu Arg Val Thr Ser Ala His Lys Gly Pro Asp Glu Thr Leu Arg 

f 130 135 14° 

U Ile Lys Ala Glu Tyr Glu Gly Asp Gly He Pro Thr Val Phe Val Ala 

H US 150 155 » 60 

Val Ala Gly Arg Ser Asn Gly Leu Gly Pro Val Met Ser Gly Asn Thr 
165 170 . , 

Ala Tyr Pro Val He Ser Cys Pro Pro Leu Thr Pro Asp Trp Gly Val 

180 185 190 

Gin Asp Val Trp Ser Ser Leu Arg Leu Pro Ser Gly Leu Gly Cys Ser 

195 200 205 

Thr Val Leu Ser Pro Glu Gly Ser Ala Gin Phe Ala Ala Gin He Phe 

210 215 220 

Gly Leu Ser Asn His Leu Val Trp Ser Lys Leu Arg Ala Ser He Leu 
225 230 235 240 

Asn Thr Trp He Ser Leu Lys Gin Ala Asp Lys Lys He Arg Glu Cys 
245 250 255 



Asn Leu 



<2) INFORMATION FOR SEQ ID NO: 151: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1038 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID N0:151: 

e G 

1 



Ile Gin Arg Phe Gly Thr Ser Gly His He Met Asn Leu Gin Ala Gin 
5 10 13 
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Pro Lys Ala Gin Asn Lys Arg Lys Arg Cys Leu Phe Gly Gly Gin Glu 

20 25 30 

Pro Ala Pro Lys Glu Gin Pro Pro Pro Leu Gin Pro Pro Gtn Gin Ser 

35 40 45 

He Arg Val Lys Glu Glu Gin Tyr Leu Gly His Glu Gly Pro Gly Gly 

50 55 60 

Ala Val Ser Thr Ser Gin Pro Val Glu Leu Pro Pro Pro Ser Ser Leu 
65 70 75 

Ala Leu Leu Asn Ser Val Val Tyr Gly Pro Glu Arg Thr Ser Ala Ala 

85 °° " 

Met Leu Ser Gin Gin Val Ala Ser Val Lys Trp Pro Asn Ser Val Met 

100 1° 5 110 

Ala Pro Gly Arg Gly Pro Glu Arg Gly Gly Gly Gly Gly Val Ser Asp 

115 12° 125 T 

Ser Ser Trp Gin Gin Gin Pro Gly Gin Pro Pro Pro His Ser Thr Trp 

130 135 140 

Asn Cys His Ser Leu Ser Leu Tyr Ser Ala Thr Lys Gly Ser Pro His 
1i5 150 155 10U 

Pro Gly Val Gly Val Pro Thr Tyr Tyr Asn His Pro Glu Ala Leu Lys 

165 170 * 

Arg Glu Lys Ala Gly Gly Pro Gin Leu Asp Arg Tyr Val Arg Pro Met 

180 185 1VU 

Met Pro Gin Lys Val Gin Leu Glu Val Gly Arg Pro Gin Ala Pro Leu 

195 200 205 

Asn Ser Phe His Ala Ala Lys Lys Pro Pro Asn Gin Ser Leu Pro Leu 

210 215 220 

Gin Pro Phe Gin Leu Ala Phe Gly His Gin Val Asn Arg Gin Val Phe 
225 230 235 

Arg Gin Gly Pro Pro Pro Pro Asn Pro Val Ala Ala Phe Pro Pro Gin 

245 250 " s 

Lys Gin Gin Gin Gin Gin Gin Pro Gin Gin Gin Gin Gin Gin Gin Gin 

260 265 270 

Ala Ala Leu Pro Gin Met Pro Leu Phe Glu Asn Phe Tyr Ser Met Pro 

275 280 285 

Gin Gin Pro Ser Gin Gin Pro Gin Asp Phe Gly Leu Gin Pro Ala Gly 

290 2°5 300 

Pro Leu Gly Gin Ser His Leu Ala His His Ser Met Ala Pro Tyr Pro 
305 310 315 -" u 

Phe Pro Pro Asn Pro Asp Met Asn Pro Glu Leu Arg Lys Ala Leu Leu 

325 330 
Gin Asp Ser Ala Pro Gin Pro Ala Leu Pro Gin Val Gin lie Pro Phe 

340 345 ■ 5iU 

Pro Arg Arg Ser Arg Arg Leu Ser Lys Glu Gly lie Leu Pro Pro Ser 

355 360 3W 

Ala Leu Asp Gly Ala Gly Thr Gin Pro Gly Gin Glu Ala Thr Gly Asn 

370 375 380 

Leu Phe Leu His His Trp Pro Leu Gin Gin Pro Pro Pro Gly Ser Leu 
385 390 395 

Gly Gin Pro His Pro Glu Ala Leu Gly Phe Pro Leu Glu Leu Arg Glu 

405 410 
Ser Gin Leu Leu Pro Asp Gly Glu Arg Leu Ala Pro Asn Gly Arg Glu- 

420 425 4jU 

Arg Glu Ala Pro Ala Met Gly Ser Glu Glu Gly Met Arg Ala Val Ser 

435 4*0 445 

Thr Gly Asp Cys Gly Gtn Val Leu Arg Gly Gly Val He Gin Ser Thr 

456 455 460 

Arg Arg Arg Arg Arg Ala Ser Gin Glu Ala Asn Leu Leu Thr Leu Ala 

465 470 

Gin Lys Ala Val Glu Leu Ala Ser Leu Gin Asn Ala Lys Asp Gly Ser 

485 490 495 

Gly Ser Glu Glu Lys Arg Lys Ser Val Leu Ala Ser Thr Thr Lys Cys 

500 505 510 

Gly Val Glu Phe Ser Glu Pro Ser Leu Ala Thr Lys Arg Ala Arg Glu 

515 520 525 

Asp Ser Gly Met Val Pro Leu lie lie Pro Val Ser Val Pro Val Arg 

530 535 540 

Thr Val Asp Pro Thr Glu Ala Ala Gin Ala Gly Gly Leu Asp Glu Asp 
*»A*5 550 553 

Gly Lys Gly Leu Glu Gin Asn Pro Ala Glu His Lys Pro Ser Val lie 

565 570 -" 3 

Val Thr Arg Arg Arg Ser Thr Arg He Pro Gly Thr Asp Ala Gin Ala 

580 585 jtu 

Gin Ala Glu Asp Met Asn Val Lys Leu Glu Gly Glu Pro Ser Val Arg 
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595 600 605 

Lys Pro Lys Gin Arg Pro Arg Pro Glu Pro Leu lie He Pro Thr Lys 

610 615 620 

Ala Gly Thr Phe lie Ala Pro Pro Val Tyr Ser Asn lie Thr Pro Tyr 
A?5 630 635 

Gin Ser His Leu Arg Ser Pro Val Arg Leu Ala Asp His Pro Ser Glu 

645 650 033 

Arg Ser Phe Glu Leu Pro Pro Tyr Thr Pro Pro Pro He Leu Ser Pro 

660 665- 6ru 

Val Arg Glu Gly Ser Gly Leu Tyr Phe Asn Ala lie lie Ser Thr Ser 

675 680 003 

Thr lie Pro Ala Pro Pro Pro He Thr Pro Lys Ser Ala His Arg Thr 

AOO 695 700 

Leu Leu Arg Thr Asn ser Ala Glu Vel Thr Pro Pro Val Leu Ser Val 

705 710 715 

Met Gly Glu Ala Thr Pro Val Ser He Glu Pro Arg He Asn Val Gly 

725 730 , 

Ser Arg Phe Gin Ala Glu He Pro Leu Met Arg Asp Arg Ala Leu Ala 

740 745 ' 3U 

Ala Ala Asp Pro His Lys Ala Asp Leu Val Trp Gin Pro Trp Glu Asp 

755 7fi 0 765 

Leu Glu Ser Ser Arg Glu Lys Gin Arg Gin Val Glu Asp Leu Leu Thr 

770 T7S 780 

Ala Ma Cys Ser Ser lie Phe Pro Gly Ala Gly Thr Asn Gin Glu Leu 

785 790 frJ 

Ala Leu His Cys Leu His Glu Ser Arg Gly Asp He Leu Glu Thr Leu 

805 810 , 

Asn Lys Leu Leu Leu Lys Lys Pro Leu Arg Pro His Asn His Pro Leu 

820 825 830 

Ala Thr Tyr His Tyr Thr Gly Ser Asp Gin Trp Lys Met Ala Glu Arg 

835 840 843 

Lys Leu Phe Asn Lys Gly He Ala He Tyr Lys Lys Asp Phe Phe Leu 

S50 855 860 

Val Gin Lys Leu He Gin Thr Lys Thr Val Ala Gin Cys Val Glu Phe 
JWi 870 875 

Tyr Tyr Thr Tyr Lys Lys Gin Val Lys lie Gly Ar„ Asn Gly Thr Leu 

885 890 
Thr Phe Gly Asp Val Asp Thr Ser Asp Glu Lys Ser Ala Gin Glu Glu 

900 905 " . 

Val Glu Val Asp He Lys Thr Ser Gin Lys Phe Pro Arg Val Pro Leu 

915 920 
Pro Arg Arg Glu Ser Pro Ser Glu Glu Arg Leu Glu Pro Lys Arg Glu 

930 935 940 
Val Lys Glu Pro Arg Lys Glu Gly Glu Glu Glu Val Pro Glu He Gin 
qa5 950 °55 vou 
Glu Lys Glu Glu Gin Glu Glu Gly Arg Glu Arg Ser Arg Arg Ala Ala 
970 V' 3 



Ala Val Lys Ala Thr Gin Thr Leu Gin Ala Asn Glu Ser Ala Ser Asp 

980 9 85 " u 

He Leu He Leu Arg Ser His Glu Ser Asn Ala Pro Gly Ser Ala Gly 

905 1000 1003 

Gly Gin Ala Ser Glu Lys Pro Arg Glu Gly Thr Gly Lys Ser Arg Arg 

1010 1015 1020 

Ala Leu Pro Phe Ser Glu Lys Lys Lys Lys Lys Gin Lys Ala 
10 25 1030 1035 

<2> INFORMATION FOR SEQ ID NO: 152: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 849 amino acids 

(B) TYPE: amino acid 

(C) STRAND ED NESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 152: 

lie Arg His Glu Val Ser Phe Leu Trp Asn Thr Glu Ala Ala Cys Pro 

1 5 10 

He Gin Thr Thr Thr Asp Thr Asp Gin Ala Cys Ser He Arg Asp Pro 

Asn Ser Gly Phe Val Phe Asn Leu Asn Pro Leu Asn Ser Ser Gin Gly 
35 40 M 
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Tvr Asn Val Ser Gly He Gly Lys lie Phe Met Phe Asn Val Cys Gly 

50 55 60 

Thr Met Pro Val Cys Gly Thr lie Leu Gly Lys Pro Ala Ser Gly Cys 
65 70 75 B0 

Glu Ala Glu Thr Gin Thr Glu Glu Leu Lys Asn Trp Lys Pro Ala Arg 

85 90 95 

Pro Val Gly He Glu Lys Ser Leu Gin Leu Ser Thr Glu Gly Phe lie 

100 105 H° 

Thr Leu Thr Tyr Lys Gly Pro Leu Ser Ala Lys Gly Thr Ala Asp Ala 

115 120 125 

Phe lie Val Arg Phe Val Cys Asn Asp Asp Val Tyr Ser Gly Pro Leu 

130 135 140 

Lys Phe Leu His Gin Asp He Asp Ser Gly Gin Gly He Arg Asn Thr 
145 150 155 160 

Tyr Phe Glu Phe Glu Thr Ala Leu Ala Cys Val Pro Ser Pro Val Asp 

165 I 70 175 

Cys Gin Val Thr Asp Leu Ala Gly Asn Glu Tyr Asp Leu Thr Gly Leu 

180 185 I 90 

Ser Thr Val Arg Lys Pro Trp Thr Ala Val Asp Thr Ser Val Asp Gly 

195 200 205 

Arg Lys Arg Thr Phe Tyr Leu Ser Val Cys Asn Pro Leu Pro Tyr He 

210 215 220 

Pro Gly cys Gin Gly Ser Ala Val Gly Ser Cys Leu Val Ser Glu Gly 
225 230 235 240 

Asn Ser Trp Asn Leu Gly Val Val Gin Met Ser Pro Gin Ala Ala Ala 

245 250 255 

Asn Gly Ser Leu Ser He Met Tyr Val Asn Gly Asp Lys Cys Gly Asn 

260 265 270 

Gin Arg Phe Ser Thr Arg He Thr Phe Glu Cys Ala Gin He Ser Gly 

275 280 285 

Ser Pro Ala Phe Gin Leu Gin Asp Gly Cys Glu Tyr Val Phe He Trp 

290 295 300 

Arg Thr Val Glu Ala Cys Pro Val Val Arg Val Glu Gly Asp Asn Cys 
305 310 315 

Glu Val Lys Asp Pro Arg His Gly Asn Leu Tyr Asp Leu Lys Pro Leu 

325 330 • 5a5 

Gly Leu Asn Asp Thr He Val Ser Ala Gly Glu Tyr Thr Tyr Tyr Phe 

340 345 55U 

Arg Val Cys Gly Lys Leu Ser Ser Asp Val Cys Pro Thr Ser Asp Lys 

355 360 365 

Ser Lys Val Val Ser Ser Cys Gin Glu Lys Arg Glu Pro Gin Gly Phe 

370 375 380 

His Lys Val Ala Gly Leu Leu Thr Gin Lys Leu Thr Tyr Glu Asn Gly 
385 390 395 * uu 

Leu Leu Lys Met Asn Phe Thr Gly Gly Asp Thr Cys His Lys Val Tyr 

405 410 415 

Gin Arg Ser Thr Ala He Phe Phe Tyr Cys Asp Arg Gly Thr Gin Arg 

420 425 wo 

Pro Val Phe Leu Lys Glu Thr Ser Asp Cys Ser Tyr Leu Phe Glu Trp 

435 440 445 

Arg Thr Gin Tyr Ala Cys Pro Pro Phe Asp Leu Thr Glu Cys Ser Phe 

450 455 460 

Lys Asp Gly Ala Gly Asn Ser Phe Asp Leu Ser Ser Leu Ser Arg Tyr 
465 470 475 *»SU 

Ser Asp Asn Trp Glu Ala He Thr Gly Thr Gly Asp Pro Glu His Tyr 

485 490 
Leu He Asn Val Cys Lys Ser Leu Ala Pro Gin Ala Gly Thr Glu Pro 

500 505 510 

Cys Pro Pro Glu Ala Ala Ala Cys Leu Leu Gly Gly Ser Lys Pro Val 

515 520 525 

Asn Leu Gly Arg Val Arg Asp Gly Pro Gin Trp Arg Asp Gly lie lie 

530 535 540 

Val Leu Lys Tyr Val Asp Gly Asp Leu Cys Pro Asp Gly lie Arg Lys 
545 550 555 aou 

Lys Ser Thr Thr He Arg Phe Thr Cys Ser Glu Ser Gin Val Asn Ser 

565 570 S' 3 

Arg Pro Met Phe He Ser Ala Val Glu Asp Cys Glu Tyr Thr Phe Ala 

580 585 5yu 

Trp Pro Thr Ala Thr Ala Cys Pro Met Lys Ser Asn Glu His Asp Asp 

595 60° 605 

Cys Gin Val Thr Asn Pro Ser Thr Gly His Leu Phe Asp Leu Ser Ser 

610 615 620 

Leu Ser Gly Arg Ala Gly Phe Thr Ala Ala Tyr Ser Glu Lys Gly Leu 

— 89 — 





625 

Val Tyr 

Gly Ala 

Arg Leu 

Ser Pro 
690 
Phe Val 
705 

Ser Leu 

Leu Ala 

lie Val 

Tyr Asp 
770 
lie Asn 
785 

Ala Gly 
lie Gly 
lie Tyr 
Lys 



630 

Met Ser lie Cys Gly Glu Asn 
645 

Gly Gin 



Cys Phe 
660 
Arg Tyr 
675 

Cys Pro 

Cys Arg 

Asp Lys 

Cys Glu 
740 
Asp Leu 
755 

Glu Ser 

lie Cys 

Ala Ala 

Arg Val 
820 
Leu Asn 
835 



Val Asp 

Ser Lys 

Pro Glu 
710 
Gin Thr 
725 

Gin Ala 

Ser Pro 

Glu Asp 

Gin Pro 
790 
Val Cys 
805 

Ala Gly 
Phe Glu 



Thr Arg I le 
665 
Gin Val Leu 

680 
Ser Gly Leu 
695 

Ala Gly Pro 



635 
Glu Asn 
650 

Ser Val 
Gin Leu 
Ser Tyr 



Thr Asn 
715 

Cys Thr Leu Phe Phe 
730 

Ser Val 



Thr Glu Cys 
745 

Leu lie His 

760 
Asp Ala Ser 
775 

Leu Asn Pro 



Cys Pro 

Gly Lys 

Val Tyr 
685 
Lys Ser 
700 

Arg Pro 
Ser Trp 
Arg Asn 



Arg Thr 
Asp Thr 



640 

Pro Gly Val 

655 
Ala Asn Lys 
670 

Lys Asp Gly 

Val lie Ser 

Met Leu lie 
720 

His Thr Pro 

735 
Gly Ser Ser 
750 

Tyr Glu Ala 



Met His 
795 

Lys Val Pro lie Asp 
810 

Leu Asn 



Gly Gly 
765 

Asn Pro Asp Phe Tyr 
780 

Gly Val 



Pro Pro I le 
825 

Ser Ser Thr 
840 



pro Cys 



Gly Pro 

Pro lie 

Gin Glu 
845 



Pro Cys Pro 
800 
Pro lie Asp 

815 
Ala Asn Glu 
830 

Phe Ser Cys 



<2> INFORMATION FOR SEQ ID NO: 153: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 852 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:153: 

Met Ala Arg Leu Ser Arg Pro Glu Arg Pro Asp Leu Val Phe Glu Glu 

1 5 1° 

Glu Asp Leu Pro Tyr Glu Glu Glu lie Met Arg Asn Gin Phe Ser Val 

Lys Cys Trp Leu His Tyr He Glu Phe Lys Gin Gly Ma Pro Lys Pro 

35 40 ^5 

Arg Leu Asn Gin Leu Tyr Glu Arg Ala Leu Lys Leu Leu Pro Cys Ser 

50 55 60 

Tyr Lys Leu Trp Tyr Arg Tyr Leu Lys Ala Arg Arg Ala Gin Val Lys 

His Arg Cys Vat Thr Asp Pro Ala Tyr Glu Asp Val Asn Asn Cys His 

85 °0 " 

Glu Arg Ala Phe Val Phe Met His Lys Met Pro Arg Leu Trp Leu Asp 

100 105 llu 

Tyr Cys Gin Phe Leu Met Asp Gin Gly Arg Val Thr His Thr Arg Arg 

115 120 125 

Thr Phe Asp Arg Ala Leu Arg Ala Leu Pro lie Thr Gin His Ser Arg 

130 135 
lie Trp Pro Leu Tyr Leu Arg Phe Leu Arg Ser His Pro Leu Pro Glu 
U5 150 155 1t>u 

Thr Ala Val Arg Gly Tyr Arg Arg Phe Leu Lys Leu Ser Pro Glu Ser 

165 I 70 . 

Ala Glu Glu Tyr lie Glu Tyr Leu Lys Ser Ser Asp Arg Leu Asp Glu 

180 1 85 190 

Ala Ala Gin Arg Leu Ala Thr Val Val Asn Asp Glu Arg Phe Val Ser 

195 200 205 

Lys Ala Gly Lys Ser Asn Tyr Gin Leu Trp His Glu Leu Cys Asp Leu 

210 215 220 

He Ser Gin Asn Pro Asp Lys Val Gin Ser Leu Asn Val Asp Ala lie 
225 230 235 240 

lie Arg Gly Gly Leu Thr Arg Phe Thr Asp Gin Leu Gly Lys Leu Trp 
245 250 " 5 
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Cys Ser Leu Ala Asp Tyr Tyr lie 
260 

Arg Asp Val Tyr Glu Glu Ala lie 
275 280 
Phe Thr Gin Val Phe Asp Ser Tyr 

290 295 
Ala Ala Lys Met Glu Thr Ala Ser 
305 310 
Asp Val Asp Leu Glu Leu Arg Leu 
325 

Arg Arg Pro Leu Leu Leu Asn Ser 
340 

His Val His Glu Trp His Lys Arg 
355 360 
Arg Glu He He Asn Thr Tyr Thr 

370 375 
Phe Lys Ala Thr Gly Lys Pro His 
385 390 
Phe Tyr Glu Asp Asn Gly Gin Leu 
405 

Lys Ala Thr Lys Val Asn Phe Lys 
420 

Trp Cys Gin Cys Gly Glu Leu Glu 
435 440 
Ala Leu Arg Leu Leu Arg Lys Ala 

450 455 
Glu Tyr Phe Asp Gly Ser Glu Pro 
465 470 
Leu Lys Val Trp Ser Met Leu Ala 
485 

Phe Gin Ser Thr Lys Ala Val Tyr 
500 

Ala Thr Pro Gin He Val lie Asn 
515 520 
Lys Tyr Phe Glu Glu Ser Phe Lys 

530 535 
Phe Lys Trp Pro Asn Val Ser Asp 
545 550 
Phe lie Ala Arg Tyr Gly Gly Arg 
565 

Phe Glu Gin Ala Leu Asp Gly Cys 
580 

Tyr Leu Leu Tyr Ala Gin Leu Glu 
595 600 
Ala Met Ala Val Tyr Glu Arg Ala 

610 615 
Gin Tyr Asp Met Phe Asn lie Tyr 
625 630 
Gly Val Thr His Thr Arg Gly lie 
645 

Ser Asp Glu His Ala Arg Glu Met 
660 

Cys Lys Leu Gly Glu He Asp Arg 
675 680 
Ser Gin 1 Le Cys Asp Pro Arg Thr 

690 695 
Lys Asp Phe Glu Val Arg His Gly 
705 710. 
Leu Arg lie Arg Arg Ser Val Gin 
725 

Phe Met Ala Ser Gin Met Leu Lys 
740 

Val Ser Asp Leu Ala Pro Gly Gin 
755 760 
Leu Glu Gin Arg Ala Glu Gin Leu 

770 775 
Pro Leu Arg Ala Gin Ser Lys He 
785 790 
Arg Glu Glu Leu Ala Glu Leu Ala 
805 

Gin Leu Gly Glu Asp Glu Asp Glu 
820 

Glu Val Arg Leu Glu Gin Gin Ser 



Arg Ser Gly His Phe 
265 

Arg Thr 



Ala Gin 

Glu Leu 

Ala Arg 
330 
Val Leu 
345 

Vat Ala 
Glu Ala 
Thr Leu 



Val Met Thr 
285 
Phe Glu Glu 

300 
Gly Arg Glu 
315 

Phe Glu Gin 
Leu Arg Gtn 



Glu Lys Ala 
270 

Val Arg Asp 

Ser Met He 

Glu Glu Asp 
320 

Leu He Ser 

335 
Asn Pro His 
350 

Gly Arg Pro 



Leu His Gin 
365 

Val Gin Thr Val Asp Pro 

380 
Trp Vat Ala 
395 

Ala Arg Val 



Asp Asp 
410 

Gtn Val Asp Asp Leu 
425 

Leu Arg 



Phe Ala Lys 
400 

He Leu Glu 

415 
Ala Ser Val 
430 

Tyr Asp Glu 



Thr Ala 

Val Gin 

Asp Leu 
490 
Asp Arg 
505 

Tyr Ala 

Ala Tyr 

He Trp 

Lys Leu 
570 
Pro Pro 
585 

Glu Glu 

Thr Arg 

He Lys 

Tyr Gin 
650 
Cys Leu 
665 

Ala Arg 

Thr Gly 

Asn Glu 

Ala Thr 
730 
Val Ser 
745 

Ser Gly 
Ala Ala 
Leu Phe 



His Glu Asn 
445 

Leu Pro Ala Arg Arg At a 

460 
Asn Arg Val 
475 

Glu Glu Ser 



Tyr Lys Ser 
480 

Leu Gly Thr 
495 

He Leu Asp Leu Arg He 
510 

Glu Glu His 



Met Phe Leu 
525 

Glu Arg Gly 

540 
Ser Thr Tyr 
555 

Glu Arg Ala 



He Ser Leu 



Leu Thr Lys 
560 

Arg Asp Leu 
575 

Lys Tyr Ala Lys Thr Leu 
590 

Ala Arg His 



Trp Gly Leu 
605 

Ala Val Glu 

620 
Arg Ala Ala 
635 

Lys Ala He 



Pro Ala Gin 



Glu He Tyr 
640 

Glu Val Leu 
655 

Arg Phe Ala Asp Met Glu 
670 

Ser Phe Cys 



Ala He Tyr 
685 

Ala Phe Trp Gin Thr Trp 

700 
Asp Thr He 
715 

Tyr Asn Thr 



Lys Glu Met 
720 

Gin Val Asn 
735 

Gly Ser Ala Thr Gly Thr 
750 

Met Lys Leu 



Met Asp Asp 
765 

Glu Ala Glu Arg Asp Gin 

780 
Val Arg Ser 
795 

Vat Asn Pro 



Gin Gin 
810 

Asp Glu Met Asp Leu 
825 

Val Pro Ala Ala Vat 
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Asp Ala Ser 
800 

Glu Glu He 

815 
Gtu Pro Asn 
830 

Phe Gly Ser 





835 840 845 

Leu Lys Glu Asp 
850 

(2) INFORMATION FOR SEQ ID N0:154: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 693 amino acids 

(B) TYPE: amino acid 

(C) STRANDEONESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 154: 

Met Phe Ser Ala Leu Lys Lys Leu Val Gly Ser Asp Gin Ala Pro Gly 

1 5 10 . 15 

Arg Asp Lys Asn lie Pro Ala Gly Leu Gin Ser Met Asn Gin Ala Leu 

20 25 . . 

Gin Arg Arg Phe Ala Lys Gly Val Gin Tyr Asn Met Lys lie Val lie 

35 40 " . 

Arg Gly Asp Arg Asn Thr Gly Lys Thr Ala Leu Trp His Arg Leu Gin 

50 55 
Gly Arg Pro Phe Val Glu Gtu Tyr lie Pro Thr Gin Glu lie Gin Val 
~ 65 70 75 80 

Thr Ser lie His Trp Ser Tyr Lys Thr Thr Asp Asp He Val Lys Val 

85 90 95 

Glu Val Trp Asp Val Val Asp Lys Gly Lys Cys Lys Lys Arg Gly Asp 

100 105 11° 

Gly Leu Lys Met Glu Asn Asp Pro Gin Glu Xaa Glu Ser Gtu Met Ala 

115 120 125 

Leu Asp Ala Glu Phe Leu Asp Val Tyr Lys Asn Cys Asn Gly Val Val 

130 135 14° 

Met Met Phe Asp lie Thr Lys Gin Trp Thr Phe Asn Tyr lie Leu Arg 
145 150 I 55 

Glu Leu Pro Lys Val Pro Thr His Val Pro Val Cys Val Leu Gly Asn 

165 170 
Tyr Arg Asp Met Gly Glu His Arg Val Ue Leu Pro Asp Asp Val Arg 

180 185 1 9u 

Asp Phe Ue Asp Asn Leu Asp Arg Pro Pro Gly Ser Ser Tyr Phe Arg 

195 200 205 

Tyr Ala Glu Ser Ser Met Lys Asn Ser Phe Gly Leu Lys Tyr Leu His 

210 215 220 

Lys Phe Phe Asn He Pro Phe Leu Gin Leu Gin Arg Gtu Thr Leu Leu 
225 230 235 " u 

Arg Gin Leu Glu Thr Asn Gin Leu Asp Met Asp Ala Thr Leu Glu Glu 

245 250 
Leu Ser Val Gin Gin Glu Thr Glu Asp Gin Asn Tyr Gly lie Phe Leu 

260 265 270 

Glu Met Met Glu Ala Arg Ser Arg Gly His Ala Ser Pro Leu Ala Ala 

275 280 285 

Asn Gly Gin Ser Pro Ser Pro Gly Ser Gin Ser Pro Val Leu Pro Ala 

290 295 300 

Pro Ala Val Ser Thr Gly Ser Ser Ser Pro Gly Thr Pro Gin Pro Ala 
305 310 315 3 f° 

Pro Gin Leu Pro Leu. Asn Ala Ala Pro Pro Ser Ser Val Pro Pro Vat 

325 330 335 

Pro Pro Ser Glu Ala Leu Pro Pro Pro Ala Cys Pro Ser Ala Pro Ala 

340 345 350 

Pro Arg Arg Ser He Ue Ser Arg Leu Phe Gly Thr Ser Pro Ala Thr 

355 360 365 

Glu Ala Ala Pro Pro Pro Pro Glu Pro Val Pro Ala Ala Gin Gly Pro 

370 375 380 

Ala Thr Val Gin Ser Val Glu Asp Phe Val Pro Asp Asp Arg Leu Asp 

385 39° 395 i 

Arg Ser Phe Leu Glu Asp Thr Thr Pro Ala Arg Asp Glu Lys Lys Val 

405 410 
Gly Ala Lys Ala Ala Gin Gin Asp Ser Asp Ser Asp Gly Glu Ala Leu 

420 425 wu 

Gly Gly Asn Pro Met Val Ala Gly Phe Gin Asp Asp Val Asp Leu Glu 

435 *40 . 445 

Asp Gin Pro Arg Gly Ser Pro Pro Leu Pro Ala Gly Pro Val Pro Ser 
450 455 460 
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Gin Asp lie Thr Leu Ser Sen GLu Glu Glu Ala Glu Val Ala Ala Pro 
465 470 475 

Thr Lys Gly Pro Ala Pro Ala Pro Gin Gin Cys Ser Glu Pro Glu Thr 

485 490 495 

Lys Trp Ser Ser lie Pro Ala Ser Lys Pro Arg Arg Gly Thr Ala Pro 

500 505 5*° 

Thr Arg Thr Ala Ala Pro Pro Trp Pro Gly Gly Val Ser Val Arg Thr 

515 520 5Z5 

Gly Pro Glu Lys Arg Ser Ser Thr Arg Pro Pro Ala Glu Met Glu Pro 

530 535 540 

Gly Lys Gly Glu Gin Ala Ser Ser Ser Glu Ser Asp Pro Glu Gly Pro 
545 550 555 300 

lie Ala Ala Gin Met Leu Ser Phe Val Met Asp Asp Pro Asp Phe Glu 

565 570 
Ser Glu Gly Ser Asp Thr Gin Arg Arg Ala Asp Asp Phe Pro Val Arg 

580 585 • >yu 

Asp Asp Pro Ser Asp Val Thr Asp Glu Asp Glu Gly Pro Ala Glu Pro 

595 600 605 

Pro Pro Pro Pro Lys Leu Pro Leu Pro Ala Phe Arg Leu Lys Asn Asp 

610 615 620 

Ser Asp Leu Phe Gly Leu Gly Leu Glu Glu Ala Gly Pro Lys Glu Ser 
625 630 635 °*>u 

Ser Glu Glu Gly Lys Glu Gly Lys Thr Pro Ser Lys Glu Lys Lys Lys 

645 650 655 

Lys Thr Lys Ser Phe Ser Arg Val Leu Leu Glu Arg Pro Arg Ala His 

660 665 oru 

Arg Phe Ser Thr Arg Val Gly Tyr Gin Val Ser Val Pro Asn Ser Pro 

675 680 685 

Tyr Ser Glu Ser Tyr 
690 



! y 

ru 

p 
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